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The design, synthesis, and use of cyclic compounds, including cyclic polyamides, is described. Such compounds comprise at least 
two polymer portions, one of which comprises at least three molecular units, and the other comprises at least four molecular units. At least 
one molecular unit of such a compound is a hydrogen bond donor or acceptor. The polymer portions are covalently linked to form a cycle. 
These compounds are capable of targeting specific nucleotide sequences in double-stranded nucleic acids, particularly double-stranded 
DNA. Accordingly, such compounds can be used to modulate, e.g., increase or decrease, the expression of one or more genes in vitro or 
in vivo. 
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SPECIFICATION 

COMPOSITIONS AND METHODS RELATING TO CYCLIC COMPOUNDS 
THAT UNDERGO NUCLEOTIDE BASE PAIR- SPECIFIC INTERACTIONS 
WITH DOUBLE -STRANDED NUCLEIC ACIDS 

U.S. GOVERNMENT RIGHTS 
The U.S. Government has certain rights in this 
invention pursuant to Grant No. GM-2 7681 awarded by the 
National Institutes of Health. 

FIELD OF THE INVENTION 
This invention relates to cyclic compounds that undergo 
nucleotide base pair-specific interactions with double 
stranded nucleic acids. The invention also concerns methods 
of using such cyclic compounds, as well as methods relating 
to their solid state synthesis. 

BACKGROUND OF THE INVENTION 
None of the following discussion of the background of 
the invention, which is provided solely to aid the reader in 
understanding the invention, is admitted to be or to 
describe prior art to the invention. 

The design of synthetic ligands capable of reading 
information stored in the DNA double helix has been a long- 
standing goal of chemistry and molecular biology. Cell- 
permeable small molecules, which target predetermined 
nucleotide sequences in double -stranded nucleic acids, 
particularly double- stranded DNA (dsDNA) , are useful in 
regulating, or modulating, gene -express ion. 
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Oligodeoxynucleotides that recognize the major groove of 
double-helical DNA via triple-helix formation bind to a 
broad range of sequences with high affinity and specificity. 
See Moser, et al . , Science , vol . 238:645-650 (1987), 
Duvalvalentin, et aL , Proc. Nat'l Acad. Sci . USA , vol . 
89:504-508 (1992), Maher, et al . , Biochemistry , vol . 31:70- 
81 (1992) . Although oligonucleotides and their analogs have 
been shown to interfere with gene expression, the triple 
helix approach so far has been limited to purine tracks and 
suffers from poor cellular uptake. 

Another recent approach to targeting specific 
nucleotide sequences in dsDNA has involved molecules known 
as u polyamides . " See USSN 08/607 , 078 , PCT/US97/03332 , USSN 
08/837,524, USSN 08/853,525, PCT/US97/12733 , USSN 
08/853 , 522 , PCT/US97/12722 , PCT/US98/06997 , PCT/US98/02444 , 
PCT/US98/02684, PCT/US98/01006 , PCT/IUS98/03 829 , and 
PCT/US98/0714 . As described in the foregoing references, 
polyamides comprise polymers of amino acids covalently 
linked by amide bonds. Preferably, the amino acids used to 
form these polymers include N-methylpyrrole (Py) and N- 
methylimidazole (Im) . 

Wade, et al . ( J. Am. Chem. Soc. , vol. 114:8783-8794 
(1992)) reported the design of polyamides that bind in the 
minor groove of dsDNA at 5 ' - (A, T) G (A, T) C (A, T) -3 ' sequences 
by a dimeric, side-by-side motif; Mrksich, et al . ( Proc . 
Natl. Acad. Sci. USA , vol. 89:7586-7590 (1992)) reported an 
antiparallel, side-by-side polyamide motif for sequence- 
specific recognition in the minor groove of dsDNA by the 
designed peptide l-methylimidazole-2-carboxamide netropsin; 
and Trauger, et al . (Nature, vol. 382:559-561 (1996)) 
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reported the recognition of a targeted dsDNA by a polyamide 
at subnanomolar concentrations. The particular order of 
amino acids in such polyamides, and their pairing in 
dimeric, antiparallel complexes formed by association of two 
5 polyamide polymers, determines the sequence of nucleotides 
in dsDNA with which the polymers preferably associate. 

The development of pairing rules for minor groove 
binding polyamides derived from N-methylpyrrole (Py) and N- 
methylimidazole (Im) amino acids provided a useful code to 

10 control target nucleotide base pair sequence specificity. 

Specifically, an Im/Py pair in adjacent polymers was fond to 
distinguish G»C from OG and both of these from A»T or T»A 
base pairs. A Py/Py pair was found to specify A«T from G*C 
but could not distinguish A»T from T»A. White, et al . 

15 ( Biochemistry , vol. 35:12532-12537 (1996)) reported the 
effects of the A»T/T»A degeneracy of Py/Im polyamide 
recognition in the minor groove of dsDNA . White, et al . 
( Chem. & Biol, vol . 4:569-578 (1997)) reported the pairing 
rules for recognition in the minor groove of dsDNA by Py/Im 

20 polyamides and the 5'->3', N-»C orientation preference for 
polyamide binding in the minor groove of dsDNA. 

More recently, it has been discovered that inclusion of 
a new aromatic amino acid, 3-hydroxy-N-methylpyrrole 
(Hp) (made by replacing a single hydrogen atom in Py with a 

25 hydroxy group) , in a polyamide and paired opposite Py 
enables A*T to be discriminated from T»A by an order of 
magnitude. Utilizing Hp together with Py and Im in 
polyamides provides a code to distinguish all four Watson- 
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Crick base pairs ( i.e. , A»T, T»A, G»C, and C#G) in the minor 
groove of dsDNA, as described in Table 1. 



Table 


1: Pairing Code for Minor Groove 




Recognition 


Pair 


G»C C»G T*A A»T 


Im/Py 


+ 


Py/Im 


+ 


Hp/Py 


+ 


Py/Hp 


+ 


Favored (+) , disfavored (-) 



As discussed above, a number of different polyamide 
motifs have been reported in the literature, including 
"hairpins, " "H-pins, " "over lapped, " "slipped, " and 

10 "extended" polyamide motifs. Specifically, hairpin 

polyamides are those wherein the carboxy terminus of one 
amino acid polymer is linked via a linker molecule, 
typically aminobutyric acid or a derivative thereof to the 
amino terminus of the second polymer portion of the 

15 polyamide. Indeed, the linker amino acid y-aminobutyric acid 
(y) , when used to connect first and second polyamide polymer 
portions, or polyamide subunits, C->N in a "hairpin motif , " 
enables construction of polyamides that bind to 
predetermined target sites in dsDNA with more than 100 -fold 

20 enhanced affinity relative to unlinked polyamide subunits. 
See Trauger, et al. , Nature, vol. 382:559-561 (1996), 
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Swalley, et al . . J . Am . Chem . Soc . , vol . 119:6953-6961 
(1997), Turner, et al . , J. Am. Chem, Soc. , vol. 119:7636- 
7644 (1997) , Trauger, et al . , Ancrew. Chemie. Int. Ed. Eng. , 
vol. 37:1421-1423 (1997), Turner, et al . , J. Am. Chem. Soc. , 
5 vol. 120:6219-6226 (1998), Kelly, et al . , Proc. Nat 1 1 Acad. 
Sci. USA , vol. 93:6981-6985 (1996), Trauger, et ai, t J. Am. 
Chem . Soc . , vol. 118:6160-6166 (1996), Geierstanger , et al . , 
Nature Struct. Biol. , vol. 3:321-324 (1996), Swalley, et 
al . , Chem . Eur . J. , vol . 3:1600-1607 (1997), and Trauger, et 

10 al^., J . Am . Chem . Soc . , vol . 120:3534-3535 (1998). 

Moreover, eight-ring hairpin polyamides (comprised of two 
four amino acid polymer portions linked C->N) have been found 
to regulate transcription and permeate a variety of cell 
types in culture. See Gottesfield, J. M. ; et al . , Nature , 

15 vol. 387:202-205 (1997). 

An H-pin polyamide motif, i.e. , wherein two paired, • 
antiparallel polyamide subunits are linked by a linker 
covalently attached to an internal polyamide pair, have also 
been reported. Another polyamide motif that can be formed 

20 between linked or unlinked polyamide subunits is an 

"extended'' motif, wherein one of the polyamide subunits 
comprises more amino acids than the other, and thus has a 
single -stranded region. See USSN 08/607,078. In contrast, 
an "overlapped" polyamide is one wherein the antiparallel 

25 polyamide subunits completely overlap, whereas in a 

"slipped" binding motif, the two subunits overlap only 
partially, with the C-terminal portions not associating with 
the N- terminal regions of the other subunit . See USSN 
08/607, 078. 
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The literature has also reported the synthesis of a 
six-ring Py/lm-containing cyclic polyamide (cyclo- (Im-Py-Py- 
y-Py-Py-Py-y) ) that bound its designated five base pair (bp) 
dsDNA target sequence ( 5 ' -TGTTA-3 ' ) at subnanomolar 
concentrations, and with 40- fold higher affinity relative to 
a hairpin analog (Im-Py-Py-y-Py-Py-Py-Dp) containing the same 
amino acid pairings. See Cho, et al . . Proc . Nat ' 1 Acad. 
Sci. USA , vol . 92:10389-10392 (1995). It was postulated 
that closing the ends of the hairpin to form a cyclic 
compound would restrict conformational space for the DNA- 
binding molecule. Significantly, however, the hairpin 
analog more strongly bound to its match sequence versus 
single base pair mismatch sites by a factor of 20, whereas 
the cyclic polyamide bound match versus mis-matched sites 
with only 4 -17-fold specificity, suggesting that an 
energetic price was paid by forming a cyclic molecule which 
had more restricted conformational flexibility as compared 
to non-cyclic polyamide motifs. Because of the discouraging 
thermodynamics of cyclic polyamides with regard to sequence 
specificity, cyclic polyamides appear to not have been 
investigated further, although the solid state synthesis of 
an 8-ring cyclic polyamide, cyclo- (Im-Py- Im-Py-y-Im-Py- Im-Py- 
(G-Dp) -y) was reported in PCT/97US/03332 , as were the 
sequences of two apparently prophetic 8 -ring cyclic 
polyamides. These sequences were cyclo (Im-Py- Im-Py-y-Im-Py- 
Im-Py- (G-Dp) ) and cyclo ( Im-Py-Py-Py-y- Im-Py- Py-Py- (G-Dp) ) . 
No biochemical characterization, e.g. , target sequence 
binding affinity or sequence specificity, was provided for 
any of these three cyclic polyamides. 
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SUMMARY OF THE INVENTION 
The present invention is based on the surprising and 
unexpected discovery of new and useful cyclic compounds, 
including cyclic polyamides, that interact in a nucleotide 
5 base pair-specific manner with desired target sequences in 
double- stranded nucleic acid molecules, particularly dsDNA, 
with affinities and specificities comparable to or better 
than naturally occurring DNA-binding proteins. 

One aspect of the invention concerns cyclic compounds 

10 having molecular structures under physiological conditions 

that enable nucleotide base pair-specific interactions to be 
formed with high affinity and specificity with double- 
stranded nucleic acid molecules. Preferably, such molecules 
have an affinity for a specific sequence of nucleotide base 

15 pairs that is less than about 100 nM, preferably less than 
about 10 nM, and even more preferably, less than about 1 nM, 
as measured by DNase footprint titration. See 
PCT/US97/03332 . Similarly, such compounds have a sequence 
specificity, or prefer to bind to one sequence, the target 

20 sequence, by a factor of greater than about 2, preferably 

greater than about 5, and even more preferably greater than 
about 10, as compared to a sequence that differs by one 
nucleotide base pair. 

Another aspect of the invention concerns cyclic 

25 compounds that contain at least seven units, at least one of 
which is a hydrogen bond donor or acceptor, configured in a 
cycle to form a molecular structure that interacts with 
specific nucleotide base pairs in a double -stranded nucleic 
acid molecule under physiological conditions. Specifically 

30 excluded from the cyclic compounds according to the 
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invention are three cyclic polyamides having the following 
amino acid sequences: cyclo- ( Im-Py- Im-Py-y-Im-Py-Im-Py- (G- 
Dp) -y) , cyclo (Im-Py-Im-Py-y-Im-Py-Im-Py- (G-Dp) ) , and 
cyclo (Im-Py-Py-Py-y-Im-Py-Py-Py- (G-Dp) ) . 

A "cycle" is a compound composed of covalently linked 
independent molecular units at least one of which is 
covalently linked directly ( i.e. , without a linker) or 
indirectly (i.e., through a linker) to at least two other 
independent molecular units of the cycle. A u covalent 
linkage" refers to a chemical bond involving the sharing of 
one or more electrons between two or more atoms, and an 
"independent molecular unit" or "unit" refers to a 
previously synthesized molecule that is incorporated into 
the cyclic compound during the latter compound's synthesis. 
Preferred independent molecular units include, but are not 
limited to, molecules that are hydrogen bond donors or 
hydrogen bond acceptors, or molecules which become hydrogen 
bond donors or acceptors following incorporation into a 
cyclic compound according to the invention. Other preferred 
molecular units include amino acids, nucleosides, and 
carbohydrates . 

A "hydrogen bond donor" refers to a molecule or group 
of atoms including a hydrogen atom covalently linked to one 
electronegatively charged atom such that the hydrogen atom 
becomes electropositively charged and can thus be 
electrostatically attracted to interact with a second 
electronegative atom or group of atoms (in either case, the 
hydrogen bond acceptor) . Representative hydrogen bond 
donors include a hydrogen covalently bonded to a nitrogen in 
an amide bond, and the 2 -amino group of guanine. 
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Representative hydrogen bond acceptors include the N3 of 
purines and Im, and the 02 of pyrimidines in dsDNA. 

A ''nucleotide- specif ic interaction" refers to a non- 
covalent interaction between a hydrogen bond donor or 
5 acceptor of a cyclic compound and a hydrogen bond acceptor 
or donor of nucleotide base in a double -stranded nucleic 
acid to form at least one hydrogen bond between a hydrogen 
bond donor or acceptor of a cyclic compound according to the 
invention with at least one of the nucleotides of a Watson- 

10 Crick base pair in a double- stranded nucleic acid. 

Preferably, at least one hydrogen bond is formed between 
each donor/donor or donor/acceptor pair of a cyclic compound 
and the adjacent nucleotide pair in the target sequence of 
the double -stranded nucleic acid, particularly dsDNA. A 

15 "donor/donor pair" or a Monor/acceptor pair" refers to a 
pair of hydrogen bond donors, or a hydrogen bond donor and 
acceptor, of a cyclic compound which reside on different 
polymer portions thereof but which are located proximate to 
one another in the cyclic compound such that each member of 

20 the pair interacts with a different member of the Watson- 
Crick nucleotide base pair with which one or both members of 
the donor /donor or donor/acceptor pair form one or more 
hydrogen bonds . 

In preferred embodiments, a "cyclic compound" refers to 

25 a compound that has two or more polymer portions or 

subunits, each of which polymer portions or subunits itself 
comprises multiple units at least one of which is a hydrogen 
bond donor or acceptor. In these embodiments, at a minimum 
the cyclic compound comprises two polymer portions. The 

30 first polymer portion comprises a polymer of at least three 
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units, one or more of which acts as either a hydrogen bond 
donor or hydrogen bond acceptor. The second polymer portion 
comprises a polymer of at least four units, at least one of 
which also acts as either a hydrogen bond donor or hydrogen 
5 bond acceptor. The units which act as hydrogen bond or 

acceptors in either of the first or second polymers portions 
may be covalently linked to one another in a particular 
polymer, or they be separated by one or more intervening 
moieties, for example, a moiety which serves to restore the 

10 binding register to the cyclic compound. "Binding register" 
refers to the alignment of hydrogen bond donors and 
acceptors of a polymer portion with their respective 
hydrogen bond acceptors and donors on one strand of a 
double -stranded nucleic acid. Preferred examples of such 

15 register restoring moieties include p-alanine ("P" ), 

glycine, and other aliphatic chains which provide proper 
spacing between flanking hydrogen bond donors and/or 
acceptors of a polymer portion so as to enable hydrogen bond 
formation with corresponding hydrogen bond acceptors and 

20 donors of a strand of the double-stranded target nucleic 
acid. 

To form a cyclic compound, the first and second polymer 
portions are attached at two or more locations, directly by 
covalent bonds between the polymers themselves, or 

25 indirectly, such as through the use of one or more linker 
molecules, or by a combination of direct and indirect 
attachment. As those in the art will appreciate, the 
resulting cyclic molecule may be symmetrical, in that each 
of the units ( e.g. , a hydrogen bond donors or acceptor) of 

30 one polymer portion align with a unit ( e.g. , a hydrogen bond 
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donor or acceptor) of the other polymer portion. 
Alternatively, the cyclic compound may be asymmetrical, 
wherein at least one unit of one polymer portion does not 
align with a unit of the other polymer portion under 
5 physiological conditions in the presence of dsDNA. 

In certain preferred embodiments of this aspect of the 
invention, at least one of the units, e .g. , a hydrogen bond 
donor or acceptor, is an amino acid. An "amino acid" refers 
to any naturally occurring or synthetic molecule having an 

10 amino group, a carboxyl group, and an R group attached to 
the same or a different carbon atom. Representative 
examples of amino acids useful in the practice of this 
invention include the L- and D- forms of the twenty amino 
acids assembled into proteins in animals and plants, other 

15 aliphatic and aromatic amino acids, including pyrrole, 
hydroxypyrrole, and imidazole, and any chemical 
modifications or derivatives of any of the foregoing. Amino 
acids may serve as hydrogen bond donors ( e.g. , Py) , hydrogen 
bond acceptors ( e.g. , Im) , as register restoring moieties 

20 (also referred to herein as "springs"), or as linkers. 
Particularly preferred for hydrogen bond formation are 
aromatic amino acids. An "aromatic amino acid" is one 
wherein the R group comprises a ring having four or more 
members, preferably 5 or 6 members. In particularly 

25 preferred embodiments, the members of the ring members are 
independently selected from the group consisting of carbon, 
nitrogen, sulfur, and oxygen. 

In other preferred embodiments, at least two of the 
hydrogen bond donors and/or acceptors of a polymer portion 

30 are covalently attached by an amide bond. Such cyclic 
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compounds are referred to herein as cyclic polyamides. In 
certain particularly preferred embodiments, the first and 
second polymer portions comprise the same number of units, 
wherein a "unit" is a molecule that provides proper 
5 nucleotide-to-nucleotide spacing for a polymer of such units 
such that a hydrogen bond- stabilized non-covalent complex 
can be formed under physiological conditions between 
hydrogen bond donors and/or acceptors in the polymer and 
targeted hydrogen acceptors and/or donors in an adjacent 

10 nucleic acid strand of a double- stranded nucleic acid. 

Clearly, not all units within a polymer need to be hydrogen 
bond donors or acceptors, but each unit should provide 
spacing such that desired hydrogen bond formation can occur 
between specific hydrogen bond donors and/or acceptors in 

15 the polymer portion and corresponding hydrogen bond 

acceptors and/or donors in the nucleic acid strand with 
which the polymer forms a non-covalent complex. However, 
the polymer must contain a sufficient number of such units 
that are hydrogen bond donors or acceptors to enable 

20 hydrogen bond-stabilized non-covalent complex formation 

between a polymer of such units and a strand of a double- 
stranded nucleic acid. 

The polymer portions, when linked to form cyclic 
compounds, may exhibit symmetrical motifs ( e.g. , an 

25 "overlapped" motif) or asymmetrical motifs ( e.g. , "slipped" 
or "extended" motifs) . In preferred embodiments, the 
polymer portions each comprise the same number of units, and 
may or may not contain the same number of hydrogen bond 
donors or acceptors. In particularly preferred embodiments, 
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each polymer portion comprises 4, 5, 6, 7, 8, 9, 10, 11, 12, 
13, 14, 15, 16, 17, 18, 19, or 20 units. 

Certain preferred embodiments of this aspect of the 
invention concern polyamides, wherein each unit of the first 
5 and second polymer portions is an amino acid, preferably 
linked by amide bonds. Particularly preferred are those 
wherein each polymer portion comprises 4, 5, 6, 7, 8, 9, 10, 
11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 amino acids, some, 
and preferably all, of which are selected from the group 

10 consisting of Py, Hp, Im, p, and glycine. In some of these 
embodiments, the first and second polymer portions comprise 
equal numbers of amino acids, the particular order of which 
determines the nucleotide base pairs with which the cyclic 
compound will preferentially interact. When each polymer 

15 portion comprise an equal number of amino acids, a cyclic 
polyamide may have an "overlapped" or "slipped" binding 
motif. In other embodiments, each polymer portion comprises 
a different number of amino acids. Thus, the cyclic 
compound will typically have an "extended" binding motif. 

20 In many embodiments, the invention concerns cyclic 

compounds wherein each unit of the first polymer portion 
aligns with a unit of the second polymer portion to form a 
unit pair that preferably associates under physiological 
conditions with at least one, but not all, nucleotide base 

25 pairs selected from the group consisting of A/T, T/A, G/C, 

or C/G. A "unit pair" refers to two units, one from each of 
two different polymer portions, the align in a cyclic 
compound such that upon interaction with a double -stranded 
nucleic acid, each unit of the pair interacts (or if no 

30 interaction, aligns) with a different nucleotide in a 
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Watson- Crick nucleotide base pair in a double -stranded 
nucleic acid. For example, if the unit pair interacts at 
A/T base pair, one unit of the unit pair interacts or 
otherwise aligns with A and the other unit aligns with T. 
Preferably, the unit pair preferably associates with at 
least one, but not all, of the four Watson-Crick base pairs 
(A/T, T/A, G/C, and C/G in dsDNA, and A/U, U/A, G/C, and C/G 
in RNA:DNA and RNA: RNA duplexes), thereby providing some 
level of sequence specificity. For example, it is known 
that Py/Py unit pairs prefer A/T or T/A base pairs in dsDNA, 
but not G/C or C/G base pairs. Particularly preferred are 
unit pairs that prefer one nucleotide base pair to the other 
three possible in a particular type double -stranded nucleic 
acid. For example, a Im/Py unit pair prefers G/C to the 
other three base pairs, whereas a Py/Im unit pair prefers 
C/G above the other three possibilities. 

"Physiological conditions" refer to conditions found in 
vivo, or, alternatively, to in vitro reaction conditions 
intended to mimic or approximate those found in vivo . With 
regard to in vitro conditions which seek to approximate in 
vivo conditions, consideration should be given to pH, salt 
concentrations, buffering capacity, temperature, and such 
other parameters as may be deeded necessary in the 
particular circumstance. For purposes of illustration, 
physiological conditions in the context of interactions 
between a cyclic compound according to the invention and 
dsDNA include those which are suitable for performing in 
vitro experiments which exploit the in vivo activity of an 
enzyme, for example, a DNA polymerase, an RNA polymerase, or 
an enzyme which modifies nucleic acids. Preferably, such 
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enzymes will have at least 50%, more preferably, at least 
about 75%, and even more preferably, at least about 90%, of 
their optimal activity under the experimental conditions 
selected. As those in the art will appreciate, what 
5 constitutes physiological conditions in a given situation 
may depend on many factors, such as the type of organism 
being considered, the environment inhabited by the organism, 
etc . 

In various embodiments of this aspect of the invention, 

10 a cyclic compound comprises at least one linker molecule. 
When used herein, "linker" refers to a molecule used to 
covalently link two polymer portions of a cyclic compound. 
Preferably, two linkers are employed, although it may be 
desirable to include 1, 2, 3, or more additional linkers. 

15 Preferred linkers are those which comprise an aliphatic 

chain, for example, P, glycine, and aminobutyric acid, with 
the latter being particularly preferred. Preferred cyclic 
compounds comprise two linkers comprising aliphatic chains, 
especially aminobutyric acid. Particularly preferred 

20 aminobutyric acids include y- aminobutyric acid, particularly 
substituted derivatives thereof, for example, (R)-2,4- 
diaminobutyric acid. Linkers may be used to link terminal 
units of polymer portions to one another, or a terminal unit 
of one polymer to an internal unit of another polymer 

25 portion. Alternatively, linkers may be used to join two 

polymer portions by covalent linkages between internal units 
of each polymer portion. 

Another aspect of the invention concerns a cyclic 
compound functionally associated (preferably by a covalent 

30 linkage) with an independent compound. Independent 
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compounds can be any compound, with other cyclic compounds, 
proteins, nucleic acids, and polyamides ( e.g. , a polyamide 
molecule having a hairpin, H-pin, extended, overlapped, or 
slipped polyamide motif) being preferred. 
5 The cyclic compounds of the invention interact with 

double -stranded nucleic acid with varying degrees of binding 
affinity and sequence specificity, depending on various 
factors, including compound size and composition, the 
targeted nucleotide base pair sequence, nucleic acid type, 

10 etc. Preferably, a cyclic compound will interact with its 
target nucleotide base pair sequence with an affinity, as 
measured by DNase footprint titration, of less than about 
100 nM, preferably less than about 10 nM, more preferably 
less than about 1.0 nM, even more preferably less than about 

15 0.1 nM. With regard to specificity (as measured in vitro 

under physiological conditions by comparing binding between 
a compound's intended target sequence, i.e. , a "match" site, 
and a test site equivalent to the target sequence except for 
a one nucleotide base pair difference, i.e. , a "single base 

20 pair mismatch" site) , a cyclic compound of the invention 
should have at least about two- fold, preferably about 3-5 
fold, more preferably 5-10-fold, and even more preferably 
greater than about 10-fold specificity for its match versus 
single base pair mismatch site. 

25 When the double- stranded nucleic acid is dsDNA, cyclic 

compounds according to the invention may target specific 
molecular structures, for example, the major or minor 
groove, associated with a target nucleic acid sequence. 
Preferred cyclic compounds are those which interact with 

30 structures in the minor groove of dsDNA in the B-form. 
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Cyclic polyamide compounds represent a particularly 
preferred class of such compounds. 

Structurally, the cyclic compounds of the invention can 
be envisioned as comprising first and second polymer 
5 portions having the following formulas: 

First polymer portion: 



10 
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- z, - 
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- z 2 - 


x 3 
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15 



wherein a >, 0 and b x , b 2 and b 3 _> 1, x x , x 2 , x 3/ and x 4 
are units, one 'or more of which serve as hydrogen bond 
donors or acceptors, and one or more of which may be the 
same or different molecules, and z lf z 2/ and z 3 each is a 
covalent linkage between adjacent units. 

Second polymer portion: 
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-Z' 2 - 


Y 3 


- Z' 4 - 


Y 4 
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wherein Ci, c 2/ c 3 and c 4 > 1, yi, y 2 , y 3/ and y 4 each are 
units, one or more of which serve as hydrogen bond donors or 
acceptors, and one or more of which may be the same or 
25 different molecules, and z'i, z' 2 , and z' 3 each is a covalent 
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linkage between adjacent units. In preferred embodiments, 
x 2 , x 3 , and x 4 and y 2 , y 3/ and y 4 form unit pairs that each 
preferably interact with at least one, but not all, Watson- 
Crick nucleotide base pairs, most preferably with only one 
5 such nucleotide base pair. The first and second polymer 
portions also typically comprise two termini. When the 
cyclic compound is a cyclic polyamide, one such terminus is 
an amino terminus, the other is a carboxy terminus. In 
particularly preferred embodiments, when contained in a 

10 cyclic polyamide, the polymer portions are arrayed in an 
ant i -parallel manner. 

Another aspect of the invention concerns pharmaceutical 
compositions comprising a cyclic compound according to the 
invention and a pharmaceutically acceptable carrier. Such 

15 compositions can be formulated into liquid or solid forms, 
and can be delivered by any appropriate route of 
administration. Preferred liquid formulations are aqueous 
solutions, suspension, slurries, gels, and emulsions. 
Preferred solid forms include pills, capsules, and powders 

20 (including lyophilized powders) . Preferred routes of 
administration include parenteral ( e.g. , subcutaneous, 
intramuscular, intravenous, and interperitoneal) injection, 
transdermal delivery, inhalation, and oral delivery. Such 
compositions may also include additional components, for 

25 example, one or more other biologically active ingredients, 
liposomes (such as may be formed from various cationic 
lipids) , etc . Such compositions may also be able to target 
delivery of a cyclic compound according to the invention to 
one or more particular cell or tissue types, or, with 

30 respect to eukaryotic cells, to an intracellular compartment 
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containing double -stranded nucleic acid, e.g. , a cell's 
nucleus (such as by complexing the cycle with a nuclear 
localization signal) . 

Other aspects relate to methods of using the compounds 
5 according to the invention. One area of application relates 
to the modulation of gene expression. "Modulation" refers 
to activating, increasing, enhancing, derepressing, 
reducing, decreasing, inhibiting, or preventing expression 
of a gene. Thus, some cyclic compounds positively affect, 

10 or up-regulate, gene expression, while others negatively 

affect, or down regulate, gene expression. A n gene" refers 
to genetic locus that encodes one or more gene products. As 
those in the art will appreciate, a gene can encode more 
than one gene product by virtue of differential mRNA 

15 splicing. Gene products include proteins ( e.g. , enzymes, 

receptors, antibodies, growth factors, and hormones) and RNA 
molecules, particularly tRNAs, ribosomal RNAs and other RNAs 
which are subunits of multi- component complexes ( e.g. , 
telomerase) , and catalytic RNAs ( e.g. , ribozymes) . To 

20 achieve the desired level of modulation in systems 

comprising cells, it is necessary to deliver a sufficient 
quantity of a cyclic compound according to the invention to 
the cells. 

In some embodiments, the use of cyclic compounds 
25 according to the invention can modulate the expression of 
more than one gene. For example, more than one cyclic 
compound, each of which specifically modulates a particular 
gene, can be delivered. Alternatively, the cyclic compound 
may directly influence the expression of more than one gene. 
30 For example, if the cyclic compound targets a nucleotide 
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base pair sequence found in a regulatory region of more than 
one gene, modulation of expression of multiple genes may 
occur. Alternatively, the cyclic compound may exhibit its 
expression-modulating effects indirectly, or by a 
combination of direct and indirect effects. For instance, 
if the cycle inhibits expression of a phosphatase that 
removes phosphates from a plurality of. proteins, the 
expression of genes regulated by pathways that involve the 
phosphatase will be affected. 

Certain embodiments of this aspect concern modulation 
of gene expression in vitro . " In vitro " includes both in 
situ and cell -free environments ( e.g. , a cell extract or in 
a well-defined reaction medium) . Thus, the compounds of the 
invention can be used to modulate gene expression in 
cultured cells, such as may be used in ex vivo therapy or 
research. 

Other embodiments of this aspect relate to the 
modulation of gene expression in vivo , some of which concern 
therapeutic purposes. As used herein, a "therapeutic 
purpose" includes both therapy ( i.e. , treatment of an 
existing condition) and prophylaxis ( i.e. , prevention) . 
Representative examples of a therapeutic purpose include 
treatment of a disease associated with aberrant expression 
of the gene of interest (as occurs in certain cancers and 
genetic diseases, for example) , as well as treatment of a 
disease associated with the presence of a pathogen ( e.g. . a 
virus or a bacterial or eukaryotic pathogenic organism) . 

Cyclic compounds according to the invention can be used 
for therapeutic purposes in conjunction with a vast array of 
organisms, including both animals and plants. Preferred 
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animals amenable to application of the therapeutic and 
prophylactic methods herein described include animals of 
agricultural importance, for example, avian (particularly 
poultry) , bovine, equine, ovine, and porcine animals, 
companion animals such as dogs and cats, and humans. With 
regard to plants, preferred plants include those of 
agricultural importance, including cereals, grains, and 
grasses. Similarly, cyclic compounds according to the 
invention can be developed to control pests, e.g. , certain 
insects and rodents. 

Yet other aspects of the invention concerns methods for 
the solid phase synthesis of one or more cyclic polyamides. 
Such methods comprise providing a solid support ( e.g. , a 
polystyrene resin) , protecting and activating the 
appropriate carboxamide monomers and/or dimers, sequentially 
adding the carboxamide monomers and/or dimers to the solid 
support, beginning with the carboxy terminal carboxamide 
residue, deprotecting the carboxamides following formation 
of the desired polyamide, releasing the polyamide from the 
solid support, and cyclizing the polyamide to form a cyclic 
polyamide. The resulting cyclic polyamide may then be 
purified, e.g. , by reverse phase HPLC. Preferably, such 
purification results in a cyclic compound that has a purity 
of more than about 90%, preferably more than about 95%, 
particularly more than about 99%. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 (top panel, a-c) shows hydrogen bonding models 
of the polyamide -dsDNA complexes formed between the 2:1 
dimer of the polyamide Im-Py-Py-Py-p~Dp (1), the 1:1 hairpin 



WO 00/40605 



PCT/US00/00298 



22 



Im-Py-Py-Py- (£) H2N y-Im-Py-Py-Py -OH (2), and the 1:1 cyclic 
polyamide cyclo- (y-Im-Py-Py-Py- (R) H2N y-Im-Py-Py-Py) (3) with 
its 6 bp match site , 5 1 -AGTACT-3 1 . Circles with dots 
represent lone pairs of the N3 of purines and the 02 of 
5 pyrimidines in the dsDNA. Circles containing an H represent 
the N2 hydrogen of guanine. Predicted hydrogen bonds are 
illustrated by dotted lines. 

Figure 2 shows the structures of the eight -ring 
polyamides Im~Py-Py-Py-p-Dp (1), Im-Py-Py-Py- (R) H2N y-Im-Py-Py- 

10 Py-P-Dp (2), Im-Py-Py-Py- (#) H2N y-Im-Py-Py-Py-OH (2), and 

cyclo- (y-Im-Py-Py-Py- (i?) H2N y-Im-Py-Py-Py) (3), as synthesized 
by solid phase methods. 

Figure 3 depicts the synthesis of Boc-Py-PAM-resin (7) 
through performance of the following steps: (i) K 2 C0 3/ DMF; 

15 (ii) Zn/AcOOH, (iii) DCC/HOBT, DMF; (iv) aminomethylated- 
polys tyrene ; DIEA . 

Figure 4 shows four monomers, Boc -pyrrole -OBt ester 
(8) (Boc-Py-OBt) , imidazole-2-Carboxylic acid 2a (Boc-Im 
acid) (9) , (R) -Fmoc-a-Boc-y-diaminobutyric acid (10), and 

20 CBZ-y-aminobutyric acid-imidazole dimer (Cbz-y-Im acid) (11) 
( see box on Fig. 4B) for synthesis of cyclic polyamides 
described herein according to the following protocol: (i) 
1:4 DMF : Piperidine (22°C, 30 min) : (ii) LiBH4, EtOH, reflux 
16 hr. (iii) 80 % TFA/DCM, 0.4 M PhSH; (iv) Boc-Py-OBt, 

25 DIEA, DMF; (v) 80% TFA/DCM, 0.4 M PhSH; (vi) Boc-Py-OBt, 

DIEA, DMF; (vii) 80% TFA/DCM, 0.4 M PhSH; (viii) Boc-Im acid 
(DCC, HOBT) for 3, Boc-Py-OBt, DIEA, DMF for 4; (ix) 80% 
TFA/DCM, 0.4 M PhSH; (x) {R) -Fmoc-a-Boc-y-diaminobutyric 
acid (HBTU, DIEA); (xi) 80% TFA/DCM, 0.4 M PhSH; (xii) Boc- 
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Py-OBt, DIEA, DMF; (xiii) 80% TFA/DCM, 0.4 M PhSH; (xiv) 
Boc-Py-OBt, DIEA, DMF; (xv) 80% TFA/DCM, 0.4 M PhSH; (xvi) 
Boc-Py-OBt, DIEA, DMF; (xvii) 80% TFA/DCM, 0.4 M PhSH; 
(xviii) Cbz-y-Im acid (HBTU, DIEA); (xix) 80% Piperdine : DMF 
5 (25°C / 30 min) ; (xx) Boc anhydride, DIEA, DMF; (xxi) 
Pd(0Ac) 2 , HC0 2 NH 4 , DMF (37°C, 8 hr) ; (xxii) DPPA, K 2 C0 3 
(xxiii) TFA (1 hr) . 

Figure 5 illustrates the 229 bp restriction fragment 
with the relative positions of the match and single base 

10 pair mis-match sequences indicated for two 8-ring cyclic 

polyamides, cyclo- (y-Im-Py-Py-Py- (R) Wlm-Py-Py-Py) (3) and 
cyclo- (y-Im-Py-Py-Py- (R) H 2 N y-Py-Py-Py-Py) (4) . . MPE*Fe(II) 
protection patterns of 10 nM cyclo- (y- Im-Py-Py-Py- (jR) H 2 N y- Im- 
Py-Py-Py) (3) and 10 nM cyclo- (y- Im-Py-Py-Py- {R) H 2 N y-Py-Py-Py- 

15 Py) (4) are shown by the bar graphs. Bar heights are 

proportional to the relative protection from cleavage at 
each base pair. Binding sites determined by MPE*Fe(II) 
footprinting and quantitated by DNase I footprint titrations 
are boxed. Autoradiograms of MPE*Fe(II) footprinting 

20 experiments using cyclo- (y-Im-Py-Py-Py- (R) H 2 N y-Im-Py-Py-Py) 
(3) (left) and cyclo- (y-Im-Py-Py-Py- (R) H 2 N y-Py-Py-Py-Py) (4) 

(right) on the 3 ! 32 P-labeled 229 bp restriction fragment of 
pJT8. 5 ■ -AGTATT-3 1 and 5 f -AGTACT-3 1 sites are shown 
adjacent to the autoradiograms. Lane 1, intact control ; 
25 lane 2, MPE»Fe(II) reaction; lane 3, A reaction standard; 

lanes 4-8: 100 pM; 1 nM; 10 nM; 100 nM; and 1 mM polyamide, 
respectively. All lanes contain 15 kcpm of either 3' or 5 1 - 
radiolabeled DNA, 25 mM HEPES buffer (pH 7.3), 200 mM NaCl, 
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50 mg/ml glycogen, 5 mM DTT, 0 . 5 mM MPE*Fe(II), and were run 
at 22°C. 

Figure 6 shows the results of affinity cleaving 
experiments using cyclo- (y-Im-Py-Py-Py- (R) EDTA - Fe(II) y-Py-Py-Py- 
Py) (4-E»Fe(II)) on the (a) 3'- and (b) 5 ' - 32 P-labeled 229 bp 
restriction fragment of pJT8 . The match and single mis- 
match sites 5 1 -AGTATT-3 1 and 5 1 -AGTACT-3 ' are shown on the 
right side of the autoradiograms . The lanes of each gel are 
described as follows: lane 1, intact DNA (control); lane 2, 
A reaction; lanes 3-7, 100 pM, 1 nM, 10 nM, and 100 nM, and 
1 \iM polyamide, respectively. All lanes contain 15 kcpm of 
either 3 1 - or 5 • -radiolabeled DNA, 20 mM HEPES buffer (pH 
7.3), 200 mM NaCl, 50 jig/ml glycogen, 5 mM DTT, 1 jiM Fe(II), 
pH 7.0, and were run at 22°C. 

Figure 7 illustrates the synthesis of cyclo- (y- Im-Py-Py- 
Py(J?) EDTA ' Fe(II, y-Py~Py>Py-Py) (4-E*Fe(II)) f rom cyclo- (y- Im- Py- 
Py-Py- (i?) H 2 N Y-Py-Py-Py-Py) (4) by: (i) EDTA dianhydride 
(DMSO/NMP, DIEA, 55°C, 15 min) ; (ii) 0. 1 M NaOH (55°C, 10 
min) . Panel (b) shows the affinity cleavage pattern for at 
100 nM concentration depicting a single binding orientation 
at the 5 1 -AGTATT-3 match site and no orientational 
preference at the 5' -AGTACT-3' mismatch site. Panel (c) 
shows a ball-and-stick model of a 4 -E*Fe (II) »5 « -AGTATT-3 1 
complex. Bar heights are proportional to the relative 
cleavage intensities at each base pair. Shaded and non- 
shaded circles denote Im and Py carboxamides, respectively. 
The boxed Fe denotes the EDTA»Fe (II) cleavage moiety. 
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Figure 8 depicts the results quantitative DNase I 
footprint titration experiments with (a) cyclo- (y-Im-Py-Py- 
Py- (£) H 2 N y-Im-Py-Py-Py) (3) and (b) cyclo- (y-Im-Py- Py-Py- 
(i?) H 2 N y-Py-Py-Py-Py) (4) on the 3' -end labeled 229-bp 

5 restriction fragment of p JT8 . Lane 1, intact DNA control ; 
lane 2, A reaction; lane 3, DNase I standard, lanes 4-13, 5 
pM, 10 pM, 20 pM, 50 pM, 100 pM, 1 nM, 2 nM, 5 nM, and 10 nM 
of the respective polyamide . The 5'-AGTACT-3 f and 5'- 
AGTATT-3 ' match and mis-match sites for each polyamide were 

10 analyzed and are shown on the right side of the respective 
autoradiograms . All reactions contained 2 0 kcpm of 
radiolabeled restriction fragment, 10 mM Tris^HCI (pH 7.0), 
10 mM KC1, 10 mM MgCl 2 , and 5 mM CaCl 2 . 

Other features and advantages of the invention will be 

15 apparent from the following description of the preferred 
embodiments thereof, and from the claims. 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention is based on the surprising 
discovery that synthetic cyclic compounds comprising more 
20 than six units, one or more of which of which acts as a 

hydrogen bond donor or acceptor, can be constructed so as to 
produce molecules having three dimensional structures that 
form nucleotide base-pair specific interactions with double- 
stranded nucleic acids. The design, synthesis, and use of 
25 such compounds is described in detail below. 



WO 00/40605 PCT/US00/00298 

26 

I. Cyclic Compounds 

A. Generic Description of Cyclic Compounds 

The present invention concerns cyclic compounds, or 
cycles, that interact with high affinity, e.g. , subnanomolar 
5 affinity, and specificity with specific double -stranded 
nucleotide sequences. Broadly, cycles according to the 
invention comprise at least seven covalently linked units 
( i.e. , independently synthesized molecules) at least one of 
which is covalently attached directly or indirectly to at 

10 least two other units of the compound. Nucleotide base pair 
specificity is provided by the particular three-dimensional 
structure of the particular cycle, which structure is 
dictated by the type and configuration of units incorporated 
into the compound. The unit composition and three- 

15 dimensional structure of such compounds specifies what type 
of interactions ( e.g. , hydrogen bonds) will occur with a 
specific sequence of nucleotide base pairs. 

Preferably, cyclic compounds according to the invention 
provide a molecular scaffold that directs the formation of 

20 specific electrostatic and non-electrostatic interactions 
with a target nucleotide sequence. Particularly preferred 
interactions are hydrogen bonds, which are non-covalent 
electrostatic interactions formed between a hydrogen bond 
and hydrogen bond acceptor. Representative hydrogen bond 

25 donors include amide hydrogen and the 2 -amino group of 

guanine. Representative hydrogen bond acceptors include the 
N3 of purines and the 02 of pyrimidines in dsDNA. Other 
preferred hydrogen bond donors and acceptors include amino 
acids, nucleosides, and carbohydrates. 
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Cyclic compounds according to the invention comprise at 
least one hydrogen bond donor or acceptor, and under 
physiological conditions form at least one hydrogen bond 
with a nucleotide present in a target nucleotide sequence in 
5 a double- stranded nucleic acid. Target nucleotide sequences 
comprise at least 4, and may comprise 5, 6, 7 , 8, 9, 10, 11, 
12, 13, 14, 15, 16, 17, 18, 19, 20, or more nucleotide base 
pairs. While it is desirable to form at least one hydrogen 
bond with at least one of the nucleotides of a Watson-Crick 

10 base pair in a double -stranded target nucleotide sequence, 
it is not essential. 

As used herein, "subnanomolar affinity" means binding 
that is characterized by a dissociation constant, , of 
less than 1 nM, as measured by DNase I footprint titration. 

15 Preferably, polyamides of the present invention are 

characterized by subnanomolar binding affinity for the 
identified target DNA sequence. As used herein, the 
"selectivity" of the binding of a polyamide to a DNA 
sequence is the ratio of the dissociation constant, , as 

20 measured by DNase I footprint titration of binding the 
polyamide to a mismatch DNA sequence divided by the 
corresponding dissociation constant of the binding of the 
polyamide to the identified target DNA sequence. 
Preferably, polyamides of the present invention are 

25 characterized by a selectivity of about 5 or greater, more 
preferably a selectivity of greater that about 10. 

A representative, but non-exhaustive list, of cyclic 
compounds within the scope of the instant invention includes 
cyclic polyamides and oligosaccharides. A number of such 
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compounds, and their design and synthesis, are described 
herein. 

B. Cyclic Polyamides 

As used herein "polyamide" refers to a polymer of 
subunits chosen from the set below: 




Boc-Py-COOH Boc-lm-COOH Im-COOH 



R 3 O 

R 3 R 3 
v-amino acid 

where R 1 is Ci- 100 alkyl (preferably d- 10 alkyl such as 
methyl, ethyl, isopropyl) , Ci- 100 alkylamine (preferably Ci-i 0 
alkylamine such as ethylamine) , C^ioo alkyldiamine 
(preferably Ci_ 10 alkyldiamine such as N,N- 

dimethylpropylamine) , a Ci- 10 o alkylcarboxylate (preferably a 
Ci_ 10 alkylcarboxylate such as CH 2 COOH) , Ci-i 00 alkenyl 
(preferably C^ 10 alkenyl such as CH 2 CH=CH 2 ) , or a 
Cx-ioo alkynyl (preferably Ci_ 10 alkenyl such as CH 2 CH=CH 3 ) , or 
a Ci- 10 ()L, where L groups can be independently chosen from 
biotin, oligodeoxynucleotide, N-ethylnitrosourea, 
fluorescein, bromoacet amide , iodoacetamide , DL-ct- lipoic 



O R 3 O 

a-amino acid 0-amino acid 
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acid, acridine, ethyl red, 4- (psoraen-8-yloxy) -butyrate, 
tartaric acid, ( + ) -cc-tocopheral . 

where R 2 is chosen from H, NH 2/ OH, SH, CI, Br, F, N- 
acetyl, or N-formyl. 
5 where R 3 is H, NH 2/ OH, SH, Br, CI, F, OMe, CH 2 OH, CH 2 SH, 

CH 2 NH 2 . 

where X is chosen from N, CH, COH, CCH 3 , CNH 2 , CC1, CF. 
More specifically, "polyamide" refers to a polymer of 
comprising one or more subunits of the formula (II) , below 




10 where R 1 is chosen from H, NH 2 , SH, Cl, Br, F, N- acetyl, 

or N-f ormyl . 

where R 2 is Ci-ioo alkyl (preferably Ci_ 10 alkyl such as 
methyl, ethyl, isopropyl) , d-ioo alkylamine (preferably d- 10 
alkylamine such as ethylamine) , d-ioo alkyldiamine 

15 (preferably Ci-io alkyldiamine such as N,N- 

dimethylpropylamine) , a d_ 10 o alkylcarboxylate (preferably a 
Ci-io alkylcarboxylate such as CH 2 COOH) , Ci-ioo alkenyl 
(preferably Ci_i 0 alkenyl such as CH 2 CH=CH 2 ) , or a Ci-ioo 
alkynyl (preferably d- 10 alkynyl such as -CH 2 C^CH 3 ) , or a 

20 Ci-iooL/ where L groups can be independently chose from 
biotin, oligodeoxynucleotide, N-ethylnitrosourea, 
fluorescein, bromoacet amide, iodoacetamide, DL-a-lipoic 
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acid, acridine, ethyl red, 4- (psoraen-8-yloxy) -butyrate, 
tartaric acid, (+) -a-tocopheral . 

where R 3 is chosen from H, NH 2/ OH, SH, Br, CI, F, Ome, 
CH 2 OH, CH 2 SH, CH 2 NH 2 . 

where R 4 is -NH (CH 2 ) 0 -iooNR 5 R 6 or NH(CH 2 ) m CO NH(CH 2 ) 0 -ioo 
NR 5 R 6 or NHR 5 or NH ( CH 2 ) m CONHR 5 . Where R 5 and R 6 are 
independently chosen from H, CI, NO, N-acetyl, benzyl, Ci-i 00 
alkyl, Ci-xoo alkylamine, C^qq alkyldiamine , Ci-i O0 
alkylcarboxylate, d-ioo alkenyl, a Ci- 100 alkynyl . Where m is 
an integer value ranging from 0 to 12. 

where X and Y are chosen from the following, N, CH, 
COH, CCH 3 , CNH 2 , CC1, CF. 

a is an integer chosen from values of 0 or 1 

b is an integer chosen integer values ranging from 1 to 

5. 

c is an integer value ranging from 2 to 10. 

Hereinafter, N-methylpyrrolecarboxamide may be referred 
to as >A Py", N-methylimidazolecarboxamide may be referred to 
as "Im", y-aminobutyric acid may be referred to as w y" , 0- 
alanine may be referred to as "P" , glycine may be referred 
to as W G" , dimethylaminopropylamide may be referred to as 
"Dp", and ethylenediaminetetraacetic acid may be referred to 
as "EDTA" . 

C. Other Cyclic Compounds 

In addition to cyclic polyamides, the present invention 
extends to other cyclic compounds that serve as molecular 
scaffolds comprised of units, some or all of which act 
hydrogen bond donors or acceptors, that provide an array of 
hydrogen bond donors and acceptors sufficient to enable 
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specific and sensitive interaction with a particular 
sequence of nucleotides in a duplex nucleic acid molecule. 
Such compounds include arrays of nucleotides, carbohydrates, 
lipids, and other compounds known in the art. 

5 

D. Cyclic Compound Polymers 

As those in the art will appreciate, certain 
embodiments of the present invention concern cyclic compound 
polymers comprised of two or more different compounds, at 

10 least one of which is a cyclic compound according to the 
invention. Representative examples of such compounds 
include conjugates between cyclic compounds and nucleic 
acids and/or proteins or polypeptides. In a cyclic acid- 
nucleic acid conjugate, the cyclic compound is covalently 

15 attached to the nucleic acid molecule. The nucleic acid is 
preferably a single-stranded oligonucleotide, although 
double -stranded nucleic acids can also be used. On the 
other hand, cyclic compound-polypeptide conjugates relate to 
those wherein the polypeptide and cycle are associated by 

20 any linkage, preferably one that is covalent . 

E» Bi- Functional Cyclic Compounds 

The design of bifunctional molecules target specific 
nucleotide sequences in double -stranded nucleic acid 

25 typically requires the integration of two separate entities: 
recognition and functional activity. Cyclic compounds that 
specifically bind with the desired affinity ( e.g. , 
subnanomolar affinity) to a predetermined, double -stranded 
nucleic acid sequence are linked (preferably covalently, but 

30 non-covalent , high affinity interactions between two members 
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of a binding pair ( e.g. , streptavidin and biotin) to a 
functional molecule, providing a bifunctional conjugate. 
Such bifunctional molecules can be used in variety of 
contexts, including therapeutically, prophylactically , 
5 diagnostically, and in research ( e.g. . in DNA, particularly 
genomic DNA, sequencing) . 

Cyclic compounds according to this invention can be 
conjugated to a variety of functional molecules, which can 
be independently chosen from, but is not limited to, 

10 arylboronic acids, biotins, polyhistidines comprised from 
about 2 to 8 amino acids, haptens to which an antibody 
binds, solid phase supports, oligodeoxynucleotides, N- 
ethylnitrosourea, fluorescein, bromoacetamide, 
iodoacetamide, DL-a-lipoic acid, acridine, captothesin, 

15 pyrene, mitomycin, texas red, anthracene, anthrinilic acid, 
avidin, DAPI, isosulfan blue, malachite green, psoralen, 
ethyl red, 4- (psoraen-8-yloxy) -butyrate, tartaric acid, ( + )- 
a-tocopheral, psoralen, EDTA, methidium, acridine, 
Ni (II) •Gly-Gly-His, TO, Dansyl, pyrene, N-bromoacetamide, 

20 and gold particles. Such bifunctional compounds are useful 
for dsDNA affinity capture, covalent dsDNA modification, 
oxidative dsDNA cleavage, and dsDNA photocleavage, among 
other applications. Such bifunctional compounds are also 
useful for dsDNA detection by providing a cyclic compound 

25 linked to a detectable label. Detailed instructions for 
synthesis of such bifunctional polyamides can be found in 
copending U.S. provisional patent application 60/043,444. 

An additional representative class of bifunctional 
cyclic compounds according to the invention are cyclic 

30 polyamides which bind in the minor groove of dsDNA but to 
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which is attached a group that intereacts, directly or 
indirectly, with cellular components ( e.g. , a DNA binding 
protein such as a transcription factor, a polymerase, etc . ) 
that interact with the neighboring major groove of the 
5 dsDNA. An example of such an indirect interaction includes 
altering the charge ( e.g. , neutralization) of a portion of 
the phosphate backbone of one or both strands of a dsDNA 
such that the resulting electrostatic change (s) in the 
particular DNA microenvironment is sufficient to disrupt the 

19 interaction of a cellular component that interacts the 
affected portion of the major groove. A hairpin polyamide 
that incorporated a three amino acid group neutralized the 
regional charge on the adjacent DNA backbone and disrupted 
binding of a DNA binding protein that interacted with the 

15 adjacent major groove. 

II* Cyclic Compound Design 

Cyclic compounds of the invention can be designed 
rationally, which design will preferably take into 

20 consideration at least the sequence of nucleotide base pairs 
of the target sequence. However, in certain embodiments of 
the invention, such information is not required or may not 
be available. 

Notwithstanding the foregoing, it is expected that in 
25 most situations the sequence of nucleotide base pairs of the 
target sequence will be known. With such information, the 
design of certain cyclic compounds can be undertaken. For 
example, for polyamides, a series of 
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simple pairing rules have been established for designing 
sequence- specific polyamides, which rules are described 
below. 

Preferred specific symmetrical cyclic polyamides 
according hereto are in the form of two polymer portions 
covalently linked by linkers (although cycle formation by 
direct covalent attachment of one polymer portion to another 
at two locations, as well as one such direct linkage between 
the two polymers and at least one indirect linkage via a 
linker, is also within the scope hereof), wherein the two 
polymer portions comprise the following structures: 

Polymer 1 : 

XiX 2 ...X m - Y-X ( m+1 j 

Polymer 2 : 

X(2m-l)X 2m 

wherein Xi, X 2 , X m , X( m+1)/ X (2m -i)/ and X 2m are carboxamide 
residues forming carboxamide binding pairs (i.e., hydrogen 
bond donor/donor or donor/acceptor pairs) X 1 /X 2m/ X 2 /X (2ra . 1)/ 
X m /X (m+1)/ and Y is P-alanine or another aliphatic register 
restoring moiety, and where carboxamide binding pair Xi/X 2m 
corresponds to base pair N X *N 1 lt carboxamide binding pair 
X 2 /X (2in _ 1) corresponds to base pair N 2 »N" 2/ and carboxamide 
binding pair X m /X im+1) corresponds to base pair N m *N 1 m . It is 
also understood that, for purposes of these embodiments, m 
shall be greater than or equal to 2, and that the second 
polymer may also comprise register-restoring moiety, if 
necessary. 

In general, the specific cyclic polyamides were 
designed by using a method that comprises the steps of 
identifying the target DNA sequence 5 ■ -WN!N 2 ..JST m W-3 ' ; 
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representing the identified sequence as 5 ' -Wab...xW-3 1 , 
wherein a is a first nucleotide to be bound by the X x unit, b 
is a second nucleotide to be bound by the X 2 unit, and x is 
the corresponding nucleotide to be bound by the X m unit; 
5 defining a as A, G, C, or T to correspond to the first 
nucleotide to be bound by a carboxamide residue in the 
identified six base pair sequence. 

Units for the polymers of these cyclic polyamides were 
selected sequentially as follows: Im was selected as the Xi 
10 unit and Py as the X 2m unit if a was G. Py was selected as 
the X x unit and Im a the X 2m unit if a was C. Hp was 
selected as the Xi unit and Py as the X 2m unit if a was T. 
Py was selected as the X x unit and Hp as the X 2m unit if a 
was A. 

15 The remaining units were selected in the same fashion. 

Im was selected as the X 2 unit and Py was the X 2m _i unit if b 
was G. Py was selected as the X 2 unit and Im as the X 2m -i 
unit if b was C. Hp was selected as the X 2 unit and Py as 
the X 2m _! unit if Jb was T. Py was selected as the X 2 unit and 

20 Hp as the X 2m -i unit if b was A. 

The selection of units was continued through m 
iterations. In the last iteration, Im was selected as the X m 
unit and Py was the X m+ i unit if x was G. Py was selected as 
the X m unit and Im as the X m+ i unit if x was C. Hp was 

25 selected as the X m unit and Py as the X m+ i unit if x was T. 
Py was selected as the X m unit and Hp as the X m+1 unit if x 
was A. 

In one preferred embodiment, the cyclic polyamide 
includes at least four consecutive unit pairs, wherein each 
30 unit pair comprises a carboxamide pair for binding to at 
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least four base pairs in a dsDNA . In another preferred 
embodiment, the cyclic polyamide includes at least five 
consecutive unit pairs, wherein each unit pair comprises a 
pair for binding to at least five base pairs in a dsDNA. In 
yet another preferred embodiment, the cyclic polyamide 
includes at least six consecutive unit pairs, wherein each 
unit pair comprises a carboxamide pair for binding to at 
least six base pairs in a dsDNA. In one preferred 
embodiment, the cyclic polyamides have four unit pairs, 
wherein each unit pair comprises a carboxamide binding pair, 
that will distinguish A*T, 1>A, OG, and G»C base pairs in 
the minor groove of a dsDNA. 

For preferred cyclic polyamides, DNA target sequence 
recognition thus depends on a code of unit pairings in the 
minor groove of double stranded DNA. Notwithstanding the 
foregoing, in many symmetric cyclic polyamides according to 
the invention, it will be preferred to incorporate one or 
more aliphatic, register restoring moieties as a unit in one 
or both polymer portions. This is particularly true when 
design considerations indicate that two or more Im residues 
should be placed in the same polymer portion, especially 
when Im residues are adjacent. Similarly, molecular 
modeling can be used to predict when and where to include 
such register-restoring moieties when other units are 
incorporated into such compounds. Moreover, inclusion of 
such moieties, particularly those that do not participate in 
hydrogen bond formation between the cyclic compound and a 
double -stranded nucleic acid, in cyclic compounds according 
to the invention will provide additional structural 
flexibility. Such structural flexibility will enable a 



WO 00/40605 PCTAJS00/00298 

37 

better three-dimensional "fit" between a cyclic compound and 
its target sequence, thereby enhancing either or both 
binding affinity and specificity. 

While pairing rules exist for polyamides, including 
5 cyclic polyamides, such may not be the case for other cyclic 
compounds within the scope hereof. In such situations, 
other approaches to compound design may be undertaken. Such 
approaches range from random synthesis methods to the use of 
molecular modeling. For example, molecular modeling based 

10 on new or existing high resolution X-ray crystallcgraphic 
and/or NMR data can be used to predict the spatial 
arrangement of hydrogen bond donors and acceptors in a 
target nucleotide sequence. This information can then be 
used to design a cyclic compound, or preferably a library of - 

15 cyclic compounds, which structurally produces a three 

dimensional molecular scaffold of hydrogen bond acceptors 
and donors capable of forming a sufficient number of 
hydrogen bonds with the target nucleotide sequence under 
physiological conditions to allow the desired 

20 compound/target interactions to occur. 

III. Cyclic Compound Synthesis 

Any suitable method can be employed to synthesize 
cyclic compounds according to the invention. Such methods 

25 include those performed in solution and those performed on a 
solid support. The latter method is particularly preferred 
when synthesizing relatively small quantities ( e.g. , less 
than one gram) of a plurality of cyclic compounds, as may 
occur in the production of a library of cyclic compounds for 

30 screening against one or more target sequences. Moreover, 
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solid phase methods may be adapted to machine -assisted 
protocols, thereby enabling automated or semi -automated 
methods to be carried out. 

Representative embodiments of such methods relate to 
the solid phase synthesis of cyclic polyamides comprising 
seven or more units. For purposes of this illustrative 
description, all units are amino acids. However, this need 
not be so in each of these embodiments. Such embodiments 
are derived by modification of in situ neutralization 
methods described by Kent and coworkers (Schnolzer, et al . , 
Int. J. Peptide Protein Res. . vol . 40:180-193 (1992); 
Milton, et al . , Science , vol . 252:1445-48 (1992)), and can 
be performed as described in USSN 08/607,078. 

To summarize, such methods preferably begin by 
obtaining or preparing a suitable solid support, for 
example, a polystyrene resin. Suitable supports include 
Boc-Py-PAM/BAM resins, Boc - Py-G - PAM/Boc - Py - S- PAM resins, and 
the corresponding Boc-Im resins. Preferably, the resin will 
contain a linker, and optionally a spacer, molecule that 
enables facile attachment and removal of the desired 
polyamide. If necessary for subsequent manipulations, the 
resin is then deprotected, for example, by washing it with 
dichloromethane followed by Boc group removal using 65% 
trif luoroacetic acid (TFA)/35% dichloromethane/0 . 5 M 
thiophenol for 20 min., followed by washing first with 
dichloromethane and then with dimethyl formamide (DMF) . 

A protected and activated carboxy terminal amino acid 
(wherein the amino group is protected and the carboxylic 
group is activated, e.g. , Boc-Py, Boc-Im-OBt, Fmoc-Py-Obt, 
Fmoc-Im-Obt, and particularly a Boc-protected allyl ester-Py 
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monomer) , or an intermediate amino acid polymer ( e.g. , an 

amino acid dimer or trimer) , is then reacted with the 

activated amino functionality of the resin. 

Diisopropylethylamine is preferably present during the 
5 coupling reaction. The use of high concentrations of amino 

acids is preferred to enhance coupling reaction rates. 

After approximately 45 min., the resin is washed with DMF. 

When coupling amino-protected Py to the amino functionality 

of Im, it is preferable to use a Py monomer activated as a 
10 symmetric anhydride rather than an -OBt ester, using the 

protocol described by Ding, et al . , Acta Chem. Scand. . vol . 

23:751 (1963), as modified in USSN 08/607,078. 

Assembly of the polyamide is continued by the 

sequential addition of the remaining amino acids. After 
15 coupling of the final amino acid to the nascent polyamide, 

the amino group is deprotected. When an allyl group is 

present, it can be removed by a palladium catalyst. The 

polyamide can then be released from the resin by any 

suitable chemistry, e.g. , by treatment with 
20 dimethylaminopropylamine . A cycle can then be formed by 

treatment with diphenylphophoryl azide (DPPA) in dilute DMF. 

See USSN 08/607,078 for a detailed discussion of such 

methods and variations thereon. 

Cyclic polyamides made in accordance herewith are 
25 preferably purified prior to use. RP-HPLC is a particularly 

preferred purification method. Irrespective of the 

purification method employed, purity levels exceeding about 

90%, preferably above about 95%, and even more preferably . 

above about 99%, are desirable. 



30 
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IV. Screening Methods 

Another aspect of the present invention concerns 
methods of screening compounds made in accordance herewith 
to identify one or more cyclic compounds capable of binding 
to a selected target nucleotide sequence in a double- 
stranded nucleic acid (the screening substrate) , 
particularly duplex DNA. Such methods involve combining one 
or more test compounds to be screened, or a mixture 
containing the test compound (s) , in an appropriate assay 
system. Typically, the assay system employs a double- 
stranded nucleic acid containing one or more test sequences, 
i.e. , the nucleotide sequence (s) with which one or more of 
the test compound (s) will hopefully interact, and one or 
more other substances ( e .q. , nucleic acids, polypeptides, 
small molecules, polysaccharides, etc. ) each of which 
interacts with the same or a different specific nucleotide 
sequence in duplex nucleic acid molecules, i.e. . the control 
sequence, which control sequence (s) are also present in the 
assay mixture. Moreover, as those in the art will 
appreciate, the test and control sequences can be the same, 
partially the same ( e.g. , overlapping by one or more base 
pairs), or different. When different, they may be separated 
by one or more base pairs, but are preferably juxtaposed, 
i -e. . not overlapping and not separated by any nucleotide 
base pairs. Indeed, the test and control sequences can be 
present on different screening substrates. 

Suitable test systems typically include a duplex 
nucleic acid molecule (preferably dsDNA) containing at least 
one test sequence, and preferably at least one control 
sequence. In those instances when a screening substrate 
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comprises only a test sequence, and does not contain a 
screening sequence, or vice versa, the assay will also 
contain a second screening substrate containing the test or 
control sequence not found on the first screening substrate. 
5 The screening substrate (s) may contain one or more copies of 
the test and/or screening sequences. 

The test sequence (s) can be obtained in a number of 
ways. For example, such sequences can be randomly generated 
(for example, by automated oligonucleotide synthesis) for a 

10 shotgun approach. Alternatively, one or more specific 
sequences ( e.g. , a specific promoter implicated in the 
manifestation of a particular disease) may be incorporated 
into the screening substrate. 

Preferred control sequences for use in such assays 

15 include regulatory regions of disease-associated genes 

( e.cr. , genes implicated in cancer or autoimmune disease) and 
of genes of pathogens ( e.g. , viruses and eukaryotic and 
prokaryotic pathogens) . In such instances, the binding 
substances employed will preferably be the proteins (or DNA 

20 binding domains thereof) known or suspected to interact with 
such sequences in vivo . Alternatively, the control 
sequence/binding substance used may be part of a well 
characterized protein/nucleic acid system, for example, the 
lac operator and repressor of E. coli or the o L -o R /cro 

25 repressor of phage lambda. Indeed, essentially any well 
characterized duplex nucleic acid/corresponding binding 
substance interaction can be readily adapted to the instant 
methods . 

In preferred embodiments, the duplex nucleic acid- 
30 binding substance used in the assay system comprises a DNA 



WO 00/40605 



PCT/US00/00298 



42 



binding protein (or portion thereof that interacts with 
dsDNA) that binds with high affinity (preferably with 
subnanomolar affinity) to a control sequence, e.g. . the 
polypeptide's cognate binding site, in a manner that is 
preferably substantially independent of the sequences 
overlapping with or adjacent or juxtaposed to the control 
sequence. However, the polypeptide (or other duplex nucleic 
acid-binding substance) should be sensitive to high affinity 
binding (which, with respect to this aspect of the 
invention, means a substance with a K a of less than about 100 
nM, preferably less than about 10 nM, particularly less than 
about 1 nM) of a compound to a test sequence, especially 
when the test sequence is the same as, overlapping with, or 
juxtaposed to (including being located on a separate 
screening substrate molecule) the control sequence. 

To function properly, the duplex nucleic acid binding 
substance should be present in an amount that saturates the 
control sequence (s) . For each assay, the test molecule is 
incubated in an assay mixture also containing the screening 
substrate (s) and binding substances for a period of time 
sufficient to permit equilibrium binding to be established 
between the test molecule (s), binding substance (s) , and test 
and control sequences. The conditions under which such 
assays are performed are preferably in vitro conditions, 
although the instant methods can readily be adapted for 
application in cultured cells. Indeed, in preferred 
methods, test compounds identified in an in vitro assay as 
having the desired activity ( e.g. , subnanomolar binding 
affinity and the ability to substantially prevent, inhibit, 
or otherwise disrupt the interaction of a binding substance 
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with its particular binding site on a screening substrate) 
are then subjected to screening in cultured cells to confirm 
their inhibitory activity on binding substance/duplex 
nucleic acid interactions. Those compounds exhibiting the 
5 desired activity profile are then typically for further 
studies, including those conducted in vivo . 

Typically, in performing such screening assays the 
amount of binding substance bound to the control sequence of 
the screening substrate is compared before and after the 

10 addition of the test molecule or mixture. In preferred 

embodiments, comparison of a binding substance -bound to free 
dsDNA can be accomplished using a gel band-shift assay, 
filter-binding assay, or a capture/detection assay. 
Alternatively, indirect detection can be also be performed, 

15 for example, by comparing the activity of a restriction 
enzyme's cleavage of its cognate restriction site in the 
presence or absence of the test compound, or by examining 
the level of expression of a reporter gene (as may be 
detected at the nucleic acid or protein level, or by 

20 detection of some other signal, e.g. , the amount of light 
generated by an assay system in which the gene, the 
expression of which is to be modulated, is lucif erase. 

Thus, screening methods according to the invention 
. include those in which binding substance/duplex nucleic acid 

25 interactions are directly assayed ( e.g. , a gel shift assay) , 
as well as those involving more indirect methods, e.g. , 
detection of reporter gene expression. The methods of the 
invention may also be carried out on one or few test 
compounds or, alternatively, be performed on a high 

30 throughput basis, wherein 100-10,000 or more test compounds 
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are screened against one or more test sequences. Such high 
throughput methods are particularly useful in conjunction 
with the solid state cyclic polyamide synthesis methods 
described herein. 

V. Compositions 

The compounds described herein can be administered to 
cells in vitro or in vivo . in addition, compounds according 
to the invention can also be used in cell-free environments, 
for example, in experiments wherein cell extracts containing 
a fraction which contains double -stranded nucleic acid 
(g -g- / a fraction containing nuclei, nuclear DNA, 
mitochondria, or miotochondrial DNA) or in experiments using 
dsDNA (e.g. , duplexed oligonucleotides substantially 
complementary over at least a portion of their lengths, 
restriction fragments, plasmids, etc. ) . Thus, the intended 
application for a cyclic compound according to the invention 
will typically dictate the ultimate product composition, as 
those in the art will appreciate. 

For example, wherein a cyclic compound according to the 
invention ( e.g. , a cyclic polyamide) is to be used 
therapeutically (e.g. , as an animal, particularly a human, 
medicine) , it is formulated into an appropriate therapeutic 
composition. Similarly, if the compound is to be used in a 
diagnostic application, it will be incorporated into an 
appropriate composition for such application. 

In its most simple form, a composition according to the 
invention may simply comprise a cyclic compound according to 
the invention, preferably as a salt. More typically, 
however, a composition according to the invention will 
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comprise a cyclic compound and one or more other compounds. 
Such other compounds, or excipients, may or may not have 
independent or activity. 

The term "pharmaceutically acceptable" or 
5 "pharmaceutical" as used herein refers to solutions or 
components of the pharmaceutical composition that do not 
prevent the therapeutic compound from exerting a therapeutic 
effect and do not cause unacceptable adverse side effects. 
For purposes of brevity, when used herein "pharmaceutical" 

10 shall be understood to include both human and other animal 

( e.g. , domestic and other animal species, particularly those 
of agricultural importance) application. Examples of 
pharmaceutically acceptable reagents are provided in The 
United States Pharmacopeia. The National Formulary, United 

15 States Pharmacopeial Convention, Inc., Rockville, MD 1990 
and FDA Inactive Ingredient Guide 1990, 1996 issued by the 
Division of Drug Information Resources (both are hereby 
incorporated by reference herein, including any drawings) . 
Unacceptable side effects vary for different diseases or 
20 conditions to be treated or prevented. Generally, the 
severity of the disorder being treated will dictate the 
severity of the toxic effects that will be tolerated. 
Unacceptable side effects for different diseases are known 
in the art . 

25 The term "physiologically acceptable" defines a 

carrier, diluent, or excipient that does not cause 
significant irritation to an organism and preferably does 
not abrogate the biological activity and properties of the 
compound or other active ingredients, if any. 
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The term "carrier" defines a chemical compound that 
facilitates the incorporation of a compound into cells or 
tissues. For example dimethyl sulfoxide (DMSO) is a 
commonly utilized carrier as it facilitates the uptake of 
many organic compounds into the cells or tissues of an 
organism. 

The term "diluent" defines chemical compounds diluted 
in water (or another solvent) that will dissolve the 
compound (s) of interest as well as stabilize the 
biologically active form of the compound (s) and other active 
ingredient (s) , if present. Many salts dissolved in buffered 
solutions are utilized as diluents in the art. One commonly 
used buffered solution is phosphate buffered saline because 
it mimics the salt conditions of human blood. Because 
buffer salts can control the pH of a solution at low 
concentrations, a diluent rarely modifies the biological 
activity of a compound having biological activity, e.g. , a 
cyclic compound according to the invention. 

The term "solvent" as used herein refers to a chemical 
that facilitates solubilization of a compounds according to 
the invention. Examples of solvents include, but are not 
limited to, pharmaceutically acceptable alcohols, such as 
ethanol, benzyl alcohol, and glycerol; polyoxyhydrocarbyl 
compounds ( e.g. . a water soluble carbohydrate such as 
glucose, sucrose, maltotriose, and the like; a water soluble 
carbohydrate derivative such as gluconic acid and mannitol, 
oligosaccharides; and water soluble polymers such as 
polyvinylpyrrolidone, polyvinyl alcohol, polyethers such as 
polyoxyalkylenes, including polyethylene glycol (PEG) and 
derivatives thereof, or other water soluble mixed 
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oxyalkylene polymers and the polymeric form of ethylene 
glycol), pharmaceutically acceptable surfactants, and 
pharmaceutically acceptable oils. 

The term "pharmaceutically acceptable surfactant" as 
5 used herein refers to a compound that can solubilize 
compounds of the invention into aqueous solutions, if 
necessary. Preferably for parenteral formulations, the 
surfactant is a non-ionic surfactant. A "pharmaceutically 
acceptable oil" is an oil such as mineral oil or vegetable 
10 oil and the like. 

A. Formulation 

For example, when a cyclic compound according to the 
invention is to be used to treat a human disease or 

15 disorder, it will typically be human patient per se, or in 
pharmaceutical compositions where they are mixed with other 
active ingredients, as in combination therapy, and/or one or 
more suitable carriers, excipient(s) ( i.e. , a more or less 
inert substance added to a composition to confer a suitable 

20 consistency or to enable a drug to be formed) , adjuvants, 

stabilizers, and vehicles. The composition may be in solid, 
liquid, gel, or aerosol form. Techniques for formulation of 
the cyclic compounds of the instant application may be found 
in "Remington's Pharmaceutical Sciences," Mack Publishing 

25 Co., Easton, PA (1995) 

The pharmaceutical compositions of the present 
invention may be manufactured in a manner that is itself 
known, e.g. , by means of conventional mixing, dissolving, 
granulating, dragee -making, levigating, emulsifying, 
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encapsulating, entrapping, spray drying, or lyophilizing 
processes . 

Proper formulation is dependent upon the route of 
administration chosen. For injection, the agents of the 
invention may be formulated in aqueous solutions, preferably 
in physiologically compatible buffers such as Hank's 
solution, Ringer's solution, or physiological saline buffer. 
For transmucosal administration, penetrants appropriate to 
the barrier to be permeated are used in the formulation. 
Such penetrants are generally known in the art. 

For oral administration, the compounds can be 
formulated readily by combining the active compound (s) with 
pharmaceutical^ acceptable carriers, excipients, etc. well 
known in the art. Such carriers excipients, etc . enable the 
compounds of the invention to be formulated as tablets, 
pills, dragees, capsules, liquids, gels, syrups, slurries, 
suspensions, and the like. Pharmaceutical preparations for 
oral use can be obtained by adding a solid excipient, 
optionally grinding a resulting mixture, and processing the 
mixture of granules, after adding suitable auxiliaries, if 
desired, to obtain tablets or dragee cores. Suitable 
excipients are, in particular, fillers such as sugars, 
including lactose, sucrose, mannitol, or sorbitol; cellulose 
preparations such as, for example, maize starch, wheat 
starch, rice starch, potato starch, gelatin, gum tragacanth, 
methyl cellulose, hydroxypropylmethyl- cellulose, sodium 
carboxymethylcellulose, and/or polyvinylpyrrolidone (PVP) . 
If desired, disintegrating agents may be added, such as the 
cross -linked polyvinyl pyrrolidone, agar, or alginic acid or 
a salt thereof such as sodium alginate. In certain non- 
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human applications, such compositions may be included in 
food preparations. 

Dragee cores are provided with suitable coatings. For 
this purpose, concentrated sugar solutions may be used, 
5 which may optionally contain gum arabic, talc, polyvinyl 
pyrrolidone, carbopol gel, polyethylene glycol, and/or 
titanium dioxide, lacquer solutions, and suitable organic 
solvents or solvent mixtures. Dyestuffs or pigments may be 
added to the tablets or dragee coatings for identification 
10 or to characterize different combinations of active compound 
doses . 

Pharmaceutical preparations that can be used orally 
include push-fit capsules made of gelatin, as well as soft, 
sealed capsules made of gelatin and a plasticizer, such as 

15 glycerol or sorbitol. The push-fit capsules can contain the 
active ingredients in admixture with filler such as lactose, 
binders such as starches, and/or lubricants such as talc or 
magnesium stearate and, optionally, stabilizers. In soft 
capsules, the active compounds may be dissolved or suspended 

20 in suitable liquids, such as fatty oils, liquid paraffin, or 
liquid polyethylene glycols. In addition, stabilizers may 
be added. All formulations for oral administration should 
be in dosages suitable for such administration. 

For buccal administration, the compositions may take 
25 the form of tablets or lozenges formulated in conventional 
manner . 

For administration by inhalation, the cyclic compounds 
for use according to the present invention are conveniently' 
delivered in the form of powder, or more preferably, as an 
30 aerosol spray from a pressurized pack, nebulizer, or metered 
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dose inhaler, with the use of a suitable propellant, e.g. , 
dichlorodif luorome thane, trichlorof luoromethane, 
dichlorotetraf luoroethane, carbon dioxide, or other suitable 
gas . 

5 Cyclic compounds can also be formulated for parenteral 

administration by injection, e.g. , by bolus injection or 
continuous infusion. Formulations for injection may be 
presented in unit dosage form, e.g. , in ampules or in multi- 
dose containers, with an added preservative. The 

10 compositions may take such forms as suspensions, solutions, 
emulsions in oily or aqueous vehicles, and lyophilized 
powders for reconstitution in an appropriate solution ( e.g. , 
sterile pyrogen- free water) prior to use, and may contain 
formulatory agents such as suspending, stabilizing, and/or 

15 dispersing agents. 

In particular, pharmaceutical formulations for 
parenteral administration include aqueous solutions of the 
active compounds in water-soluble form. Additionally, 
suspensions of the active compounds may be prepared as 

20 appropriate oily injection suspensions. Aqueous injection 
suspensions may contain substances that increase the 
viscosity of the suspension, such as sodium carboxymethyl 
cellulose, sorbitol, or dextran. Optionally, the suspension 
may also contain suitable stabilizers or agents that 

25 increase the solubility of the compounds to allow for the 
preparation of highly concentrated solutions. 

In addition to the formulations described previously, 
the cyclic compounds of the invention may also be formulated 
as a depot preparation. Such long acting formulations may 

30 be administered by implantation (for example, subcutaneously 
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or intramuscularly) or by intramuscular injection. Thus, 
for example, the compounds may be formulated with suitable 
polymeric or hydrophobic materials (for example as an 
emulsion in an acceptable oil) or ion exchange resins, or as 
5 sparingly soluble derivatives, for example, as a sparingly 
soluble salt . 

Other delivery systems for the pharmaceutical 
compositions compounds may also be employed. For example, 
liposomes and emulsions are well known examples of delivery 

10 vehicles or carriers for hydrophobic drugs. Use of delivery 
vehicles such as liposomes affords the opportunity to 
provide one or more targeting moieties in the composition, 
thereby enabling the cyclic compound to be delivered to a 
particular cell or tissue. Cell and tissue targeting 

15 techniques and moieties are known in the art, and will vary 
depending upon the particular application. 

Additionally, cyclic compounds may be delivered using a 
sustained-release system, such as semi -permeable matrices of 
solid hydrophobic polymers containing the therapeutic agent. 

20 Various types of sustained-release materials have been 

established and are well known by those skilled in the art. 
Sustained-release capsules may, depending on their chemical 
nature, release the compounds for a few weeks up to over 100 
days. Depending on the chemical nature and the biological 

25 stability of the cyclic compound, additional strategies for 
protein stabilization may be employed. 

For non-animal application, for example, to plants, the 
compositions of the invention will typically comprise the 
cyclic compound in a solution which can be sprayed on the 

30 plants or included in irrigation water. Alternatively, 
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solid dosage forms that can be broadcast upon or near the 
targeted plant (or seeds) can also be prepared in accordance 
with methods known in the art. 

Many of the cyclic compounds of the invention may be 

5 provided as salts with pharmaceutical^ compatible 

counterions. Pharmaceutically compatible salts may be 
formed with many acids, including, but not limited to, 
hydrochloric, sulfuric, acetic, lactic, tartaric, malic, 
succinic, etc . acid. Salts tend to be more soluble in 

0 aqueous or other protonic solvents than are the 
corresponding free base forms. 

B. Administration 

In a therapeutic or prophylactic context, the term 
5 "administering" relates to a method of incorporating or 

delivering a cyclic compound of the invention into cells or 
tissues of an organism, be it an animal or a plant. As 
those in the art will appreciate, depending upon 
application, administration can be either in vivo ( i.e. , to 
a living animal or plant) or in vitro ( e.g. , to cultured 
cells) . With regard to in vivo administration to an animal 
( e.g. , a human), many administration techniques exist, 
including, but not limited to, oral, parenteral {e.g. 
intravenous , intramuscular , subcutaneous , intraperitoneal , 
and intraarticular injection) , and aerosol administration. 
The compounds may also be administered in a depot or 
sustained release formulation. For in vitro (including ex 
vivo ) therapy, multiple administration techniques also 
exist, including cell microinjection techniques, simple 
diffusion., and carrier techniques. 
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To affect gene expression in a cell, which may include 
causing an increase or a decrease in the expression of one 
or more genes, an effective quantity of one or more cyclic 
compounds polyamide is delivered to and internalized by the 
5 cell, be it in vivo or in vitro . Effective extracellular 
concentrations of polyamides that can modulate gene 
expression range from about 0.01 nanomolar (nM) to about 1 
micromolar ()iM) , more preferably from about 0.1 nM to about 
0.1 j.tM, particularly from about 1 nM to about 0.05 ]xM f 

10 especially about 10 nM to about 0.75 jaM. Gottesfeld, et 
al . , Nature , vol. 387:202-205 (1997). 

Notwithstanding the foregoing, those in the art will 
appreciate that, with regard to therapeutic or prophylactic 
treatment, the selection of the precise concentration, 

15 composition, and delivery regimen will be influenced by, 

inter alia, the specific pharmacological properties of the 
particular composition, the intended use, the nature and 
severity of the condition being treated or prevented, the 
age, weight, gender, and physical condition of the intended 

20 recipient, as well as the route of administration. Such 
considerations are within the purview of the skilled 
artisan. For example, to determine effective amounts and 
concentrations of cyclic polyamides in vitro , a suitable 
number of cells is plated on tissue culture plates and 

25 various quantities of one or more cyclic polyamides are 
added to separate wells. Expression of the particular 
gene(s) whose expression is (are) to be modulated following 
exposure to the compound can be monitored in the cells or 
medium by detecting the amount of the protein gene product 

30 present as determined by various techniques utilizing 
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specific antibodies, including ELISA and Western blotting. 
Alternatively, gene expression following exposure to a 
polyamide can be monitored by detecting the amount of 
messenger RNA present, as determined by various techniques, 
including Northern blotting and RT-PCR. 

Similarly, to determine effective amounts and 
concentrations of cyclic compounds for in vivo 
administration, a sample of body tissue or fluid, such as 
plasma, blood, urine, cerebrospinal fluid, saliva, or biopsy 
of skin, muscle, liver, brain or other appropriate tissue 
source, can be analyzed. Gene expression following exposure 
to a polyamide can be monitored by detecting the amount of 
the protein gene product present as determined by various 
techniques utilizing specific antibodies, including ELISA 
and Western blotting. Alternatively, gene expression 
following exposure to the compound can be monitored by the 
detecting the amount of messenger RNA present as determined 
by various techniques, including northern blot and RT-PCR. 
Such determinations can also be made by these and others 
techniques applied to cells from an organism of the type to 
be treated (or an animal model therefore) before and after 
administration of a composition according to the invention. 
These and other strategies known in the art can be readily 
adapted to a given application for a particular cyclic 
compound . 

VI. Applications 

A. Modulation of Gene Expression 

The cyclic compounds described herein are useful for 
modulating expression of one or more genes in vitro or in 
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vivo . Such modulation is typically accomplished by 
delivering a quantity of a cyclic compound according to the 
invention sufficient to interact with a specific nucleotide 
sequence present in double -stranded nucleic acid ( e.g. , 
5 dsDNA) , preferably a regulatory sequence, and cause a change 
in expression of a gene functionally associated with such 
regulatory sequence. Such sequences include promoters, 
enhancers, repressor binding sites, and any other nucleotide 
sequence targeted by a DNA binding protein or other 

10 substance in a cell that can affect transcription. Cellular 
events involved in transcription include nucleosome 
formation and dissociation, formation of RNA transcription 
complexes, transcription initiation, etc . 

Genes the expression of which can be modulated in 

15 accordance with this invention include cellular or viral 
genes. Such cellular genes include both eukaryotic and 
prokaryotic genes. The cellular gene(s) can be present in 
original, native cells, in cells transfected or transformed 
with a recombinant DNA construct comprising the cellular 

20 gene, or in an in vitro , cell-free system, for example, as a 
reporter gene. Similarly, a viral gene can be present in a 
cell or in an in vitro , cell-free system. 

The polyamides of the present invention can act as 
specific inhibitors or activators of gene transcription in 

25 vivo or in vitro as therapeutic or prophylactic agents in 

treating disease conditions related to the transcription of 
at least one cellular or viral gene. 

Certain embodiments of the present invention concern 
the use of a unique or rare sequence adjacent to or 

30 overlapping with the binding sites for common transcription 
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factors as the target sequences for the design of cyclic 
compounds. It has been found that sequences adjacent to the 
binding sites for required transcription factors are unique, 
i »e. , are not associated with genes in the current publicly 
5 available nucleic acid databases. 

Many protein coding genes utilize both gene- and 
tissue-specific transcription factors as well as general 
transcription factors for transcription of mRNA by RNA 
polymerase II. The binding sites for these protein factors 

10 are found in numerous genes, whereas the sequences adjacent 
to or overlapping with these binding sites tend to be unique 
for each gene. Cyclic compounds can be designed which 
target sequences adjacent to or overlapping with the binding 
sites for these transcription factors, as well as to the 

15 binding sequences for these factors. Cyclic compounds that 
target these sequences will interfere with the binding of 
the protein factors to DNA and thereby inhibit transcription 
by RNA Polymerase II. 

In other embodiments, cyclic compounds can be designed 

20 and synthesized that recognize and bind the sequences 

immediately adjacent to, or overlapping with, the site at 
which the minor groove -binding protein TATA-box binding 
protein (TBB) binds to TATA DNA can be designed. DNA 
sequences adjacent to or overlapping with TATA elements are 

25 gene-specific, whereas TATA elements are found in many 
protein-coding genes. For example, a hairpin polyamide 
bound to a sequence adjacent to the HIV-1 TATA element has 
been shown to inhibit HIV-1 promoter-specific transcription 
by RNA polymerase II. A cyclic compound designed to 
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selectively bind this site would be useful to treating 
diseases associated with HIV-1 infection. 

In other embodiments, the cyclic compounds will 
recognize and bind to an identified target sequence adjacent 
5 to the transcription factor protein binding site of a 

cellular gene, for example, a const i tut ively expressed gene 
under basal transcription control ( e.g. , the gene encoding 
the 5S ribosomal subunit) . 

In yet other preferred embodiments, the minor groove 

10 transcription factor protein of the cellular gene is TBP . 

Such preferred cellular genes include oncogenes such as LEF- 
1, Ets-1 and her-2/neu. Other such preferred cellular genes 
include genes encoding cytokines such as interleukins , 
including IL-2, IL-5 and IL-13, tumor necrosis factors, 

15 including TNF-alpha and TNF-beta, growth factors, including 
TGF-beta, and colony stimulating factors, including GM-CSF. 

Using the above described rules, a sequence- specif ic 
cyclic compound can be designed that selectively binds to an 
identified target site adjacent to the binding site of a DNA 

20 binding protein, e.g. , transcription factor. As used 

herein, "adjacent" includes cyclic compound binding sites 
wherein an end nucleotide base pair of the binding site is 
immediately contiguous to an end nucleotide of the DNA 
binding protein binding site, and cyclic compound binding 

25 sites separated from the protein binding site by from one to 
about 20, preferably from 1 to about 10 or fewer intervening 
nucleotide base pairs. The binding affinity of such a 
designed cyclic compound should be greater than the binding 
affinity of the native transcriptional element in order to 

30 inhibit transcription. The binding affinity can be 
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ascertained by competitive inhibition experiments against a 
native transcription factor. "Overlapping" refers to a 
cyclic compound binding site wherein one to five nucleotide 
base pairs of the compound's binding site are shared with 
the binding site of the DNA binding protein. 

A "promoter" is a regulatory sequence of dsDNA involved 
in the binding of RNA polymerase to initiate transcription 
of a gene. A "gene" is a segment of DNA which codes for a 
"gene product", typically a peptide, polypeptide, or 
protein, including the coding region, non- coding regions 
preceding {"leader") and following ("trailer") the coding 
region, as well as intervening non-coding sequences 
("introns") between individual coding segments ("exons"). 
Coding refers to the representation of amino acids, start 
and stop signals in a three base "triplet" code, and other 
signals, such as polyadenylation signals. "Gene products" 
also include RNA molecules, for example, tRNAs , small 
nuclear RNAs, ribosomal RNAs, and catalytic RNAs ( e.g. , 
ribozymes) . A "gene of interest" refers to a gene the 
expression of which is desired to be modulated using a 
cyclic compound according to the invention. 

Promoters are often upstream (" * 5 to") the 
transcription initiation site of the corresponding gene. 
Other regulatory sequences of DNA in addition to promoters 
are known, including sequences involved with the binding of 
transcription factors, including response elements that are 
the DNA sequences bound by inducible factors. "Enhancers" 
comprise yet another group of regulatory sequences of DNA 
that can increase the utilization of promoters, and can 
function in either orientation (5'-»3' or 3'->5') and in any 
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location (upstream or downstream) relative to the promoter. 
In some embodiments, the regulatory sequence has a positive 
activity, i.e. , binding of an ehdogeneous ligand ( e.g. , a 
transcription factor) to the regulatory sequence increases 
5 transcription, thereby resulting in increased expression of 
the corresponding target gene. In such a case, interference 
with transcription by binding a polyamide to a regulatory 
sequence would reduce or abolish expression of a gene. 
A promoter can also include or be adjacent to a 

10 regulatory sequence known in the art as a "silencer" . A 
silencer generally has a negative regulatory effect on 
expression of the gene. In such cases, expression of a gene 
may be increased (also referred to as gene "activation" or 
"derepression" ) directly by using a cyclic compound 

15 according to the invention, e.g. , a polyamide, to prevent 
binding of a factor to a silencer regulatory sequence, or 
indirectly by using such a compound to block transcription 
of a factor that interacts with a silencer. 

To affect gene expression in a cell, which may include 

20 causing an increase or a decrease in gene expression, a 
quantity of one or more cyclic compounds effective to 
modulate transcription is contacted with the cell and 
internalized by the cell. The cell may be contacted by the 
polyamide in vivo or in vitro . Effective transcription 

25 inhibiting extracellular concentrations of polyamides that 
can module gene expression range from about 10 nanomolar to 
about 1 micromolar. Gottesfeld, J.M., et al . , Nature 
387:202-205 (1997). To determine effective amounts and 
concentrations of polyamides in vitro , a suitable number of 

30 cells is plated on tissue culture plates and various 
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quantities of one or more polyamides are added to separate 
wells. Gene expression following exposure to a cyclic 
compound can be monitored in the cells or in the medium by 
detecting the amount of the protein gene product present as 
5 determined by various techniques utilizing specific 
antibodies, including ELISA and Western blot. 
Alternatively, gene expression following exposure to a 
cyclic compound can be monitored by detecting the amount of 
messenger RNA present as determined by various techniques, 

10 including Northern blot and RT-PCR. 

Similarly, to determine effective amounts and 
concentrations of polyamides for in vivo administration, a 
sample of body tissue or fluid, such as plasma, blood, 
urine, cerebrospinal fluid, saliva, or biopsy of skin, 

15 muscle, liver, brain or other appropriate tissue source is 
analyzed. Gene expression following exposure to a cyclic 
compound can be monitored by detecting the amount of the 
protein gene product present as determined by various 
techniques utilizing specific antibodies, including ELISA 

20 and Western blot. Alternatively, gene expression following 
exposure to a cyclic compound can be monitored by detecting 
the amount of messenger RNA present as determined by various 
techniques, including Northern blot and RT-PCR. 

25 1 . Inhibition 

The present invention provides cyclic compounds, 
including cyclic polyamides, which inhibit the transcription 
of DNA upstream or downstream of transcriptional factors 
such as the 5S RNA gene transcriptional factor TFIIIA, the 
30 minor groove -binding protein TATA-box binding protein (TBP) , 



WO 00/40605 PCT/US00/00298 

61 

Ets-l, and LEF-1 . Such compounds can act as gene-specific 
inhibitors of transcription since they are selective for the 
sequences flanking these protein binding sites that are, in 
turn, gene-specific . 

5 

2 . Activation 

In addition to being useful to inhibit gene 
transcription, the cyclic compounds according to the 
invention can also be used to activate transcription of one 

10 or more genes. Such activation will typically involve 

interference with, or inhibition of, binding of a repressor 
protein or a similar molecule to a specific regulatory 
region of the gene of interest. Accordingly, the methods 
used to design and synthesize cyclic compounds, particularly 

15 cyclic polyamides, to inhibit gene transcription can 

similarly be applied in the context of activation of a gene 
of interest . 

B. Treatment/Prophylaxis 

20 As will be clear to those in the art, the cyclic 

compounds described herein can be used to treat or prevent a 
variety of diseases and other conditions associated with the 
expression of one or more genes, and will thus have many 
applications, including anti -viral, anti-bacterial, anti- 

25 fungal, and anti-cancer applications. 

In the treatment context, an appropriate amount of a 
cyclic compound according to the invention will be 
administered to an organism known to have, or suspected of 
having, a particular disease or disorder. In addition, 

30 cyclic compounds can be used prophylactically to prevent the 
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development of an unwanted disease or disorder by preventing 
undesirable gene expression. In either context, the cyclic 
compounds will be administered in accordance with the 
teachings provided herein. 
5 A representative example of how cyclic compounds 

according to the invention can be used in a therapeutic 
context concerns HIV. A recent review summarizes current 
knowledge of the protein factors required for the control of 
RNA initiation and elongation by RNA polymerase II at the 

10 HIV-1 promoter (Jones, K.A. and B.M. Peterlin. 1994. 
Control of RNA initiation and elongation at the HIV-1 
promoter. Annu. Rev. Biochem. , 63:717-743). For HIV, the 
template for synthesis of both new viral RNA and messenger 
RNA (for viral protein synthesis) is the integrated 

15 provirus. HIV-1 utilizes the transcription machinery of the 
host cell but encodes its own trans-activators, Tat and Rev, 
that are responsible for RNA elongation and utilization. 

The HIV-1 promoter is located in the U3 region of the 
leftward (5') long terminal repeat. The core promoter and 

20 enhancer elements span a region of approximately 250 base 

pairs and include TTA and initiator elements ad the binding 
sites for the following cellular transcription factors: 
Spl, NF-kB, LEF-1, Ets-1 and USF. Sequences upstream of the 
NF-kB sites contribute only marginally to HIV-1 promoter 

25 activity enter in vitro or in transfected cell lymphoid cell 
lines. Waterman, M.L. and K.A. Jones, New Biologist, 2:621- 
636 (1990) . However, these upstream sequences, and 
presumably the protein factors which bind these upstream 
sequences, are important for viral replication, and hence 

30 promoter activity, in peripheral blood lymphocytes and in 
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some T cell lines. Kim, J., et al . , J. Virol, 67:1658-1662 

(1993) . 

Two of the binding sites in the upstream region 
correspond to recognition sites for activator proteins that 
5 are lymphoid cell specific (or highly enriched in T cells) 
and are shared with the promoter of the T cell receptor 
(TCRa) gene: these are the Ets-1 and LEF-1 transcription 
factors . The essential role of the upstream region has been 
reproduced in vitro (Sheridan, P.L., et al . , Genes Dev., 

iO 9:2090-2104 (1995)). LEF-1 and Ets-1 are believed to act in 
concert to prevent nucleosome-mediated repression in vivo . 
Inhibition of formation of this complex may represent a 
viable target for HIV-1 therapy. LEF-1 belongs to the HMG 
family of proteins and binds DNA as a monomer in the minor 

15 groove, resulting in a large distortion of the DNA helix 
(unwinding and bending) (Love, J.J., et al . , Nature, 
376:791-795 (1995) ) . 

In addition, LEF-1 possesses a strong trans-activation 
domain that retains its function when engineering into other 

20 DNA-binding proteins (Giese, K. , et aJ w Genes Dev., 9:995- 
1008 (1995)). 

The HIV-1 promoter also contains tandem binding sites 
for NF-kB, a factor that is strongly induced by HIV 
infection (Bachelerie, F., et al., Nature, 350:709-712 
25 (1991)), and multiple binding sites for the general 

transcription factor Spl. NF-kB contacts both Spl and the 
TBP subunit of the basal transcription factor TFIID. 
Perkins, N.D., et al . , Mol . Cell. Biol., 14:6570-6583 

(1994) . Additionally, Spl has been shown to interact with 
30 the TAF110 subunit of TFIID (HOkDa TBP-associated factor) 
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(Chen, J.L., etal., Cell, 79:93-105, 1994). The binding of 
TFIID via the TBP interaction with the TATA element 
nucleates the assembly of the complete RNA polymerase II 
transcription complex (reviewed in Maldonado, E. and D. 
Reinberg, Current Opinion in Cell Biology, 7:352-361, 1995). 
Thus, NF-kB may function through recruitment of Spl and 
TFIID to the HIV-1 promoter via these protein-protein 
interactions. Accordingly, blocking the NF-kB-DNA and/or 
Spl-DNA interaction is another target for HIV therapy. 

Since these factors, and especially Spl and TFIID, are 
utilized for the expression of a wide range of cellular 
genes, the binding sites for these factors are not ideal 
targets for HIV-specific inhibition (or any gene-specific 
inhibition) . However, the sequences adjacent to these sites 
that are unique to HIV-l proviral DNA are excellent 
candidate targets for the design of inhibitory cyclic 
compounds . 

Organisms suitable for administration of cyclic 
compounds according to the invention include any plant or 
animal containing, or suspected to contain, a gene the 
expression of which would be desirable to modulate. 
Preferred animals include mammalian, fish, and avian 
species. Particularly preferred mammals include humans and 
bovine, canine, equine, feline, ovine, and porcine animals. 
Particularly preferred avian and fish species are those of 
commercial or ecological significance. Preferred plants 
include commercially or ecologically significant trees, 
grains, grasses, and cereals. 
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C. Diagnostics 

Cyclic compounds according to the present invention are 
also useful for detecting the presence of double- stranded 
nucleic acid, particularly dsDNA, containing a specific 
5 sequence of nucleotide base pairs for diagnostic or 

preparative purposes. For example, a sample containing 
dsDNA can be contacted by a sequence-specific cyclic 
compound linked to a solid substrate, thus enabling 
isolation of dsDNA comprising the desired sequence. 

10 Alternatively, cyclic compounds linked to a suitable 

detectable marker, such as biotin, a hapten, a radioisotope, 
or a dye molecule, can be contacted by a sample containing 
dsDNA suspected to contain the desired target nucleotide 
base pair sequence. Such bifunctional compounds complexed 

15 to dsDNA can then be detected using an appropriate detection 
system known to those skilled in the art. For example, DNA 
associated with a cyclic polyamide linked to biotin can be 
detected by a streptavidin/alkaline phosphatase system. 
Other diagnostic applications the cyclic compounds 

20 according to the invention include double -stranded nucleic 
acid, particularly dsDNA, sequencing on a solid support 
comprising a plurality of cyclic compounds specific for 
different nucleotide base pair sequences. 

The present invention also provides diagnostic systems, 

25 preferably in kit form, comprising cyclic compounds 

according to the invention. Representative embodiments of 
such systems include kits for assaying for the presence in a 
cell or tissue sample of a dsDNA sequence bound by a 
particular cyclic compound according to the invention. Such 

30 systems include, in an amount sufficient to perform at least 
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one assay, one or more specific cyclic compounds ( e.g. , a 
cyclic polyamide) according to the invention as a separately 
packaged reagent. Instructions for use of the packaged 
reagent (s) are also typically included. As used herein, the 
5 term "package" refers to a solid matrix or material such as 
glass, plastic ( e.g. , polyethylene, polypropylene, or 
polycarbonate), paper, foil, and the like capable of holding 
within fixed limits such cyclic compound (s) . Thus, for 
example, a package can be a glass vial used to contain 

10 milligram quantities of a contemplated cyclic compound, a 
microliter plate well to which microgram quantities of a 
contemplated cyclic compound have been operatively affixed, 
i s e . , linked so as to be capable of being bound by the 
target sequence in dsDNA, or a glass slide to which a 

15 plurality ( i.e. , more than one, and to 100,00 or more) of 

such cyclic compounds are operatively affixed, as may be thei 
case in certain nucleic acid sequencing methods using the 
instant cyclic compounds. When necessary, such diagnostic 
systems preferably also include a detectable label or other 

20 indicator capable of signaling the binding of the 

contemplated cyclic compound to its target sequence. As 
noted above, numerous detectable labels, such as biotin, and 
other indicators, such as enzyme -linked (direct or indirect) 
streptavidin, are well known in the art. 

25 

D. Research Reagents 

The cyclic compounds described herein will also find 
utility as research reagents. For example, administration 
of such compounds to cultured cells will enable the 
30 production of cells having desired phenotypes . Such 
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phenotypes may be useful for a variety of purposes, 
including the study of the effects of other compounds on the 
expression levels of one or more genes. Alternatively, 
cyclic compounds according to the invention can also be used 
5 to produce "knock-out" mutations wherein actual changes at 
the genetic level are unnecessary. 

Cyclic compounds according to the invention will also 
find use in in vitro research applications. For example, 
addition of a cyclic compound that inhibits binding of a 

10 restriction enzyme to its cognate restriction site will 
prevent cleavage of a particular DNA containing such 
restriction site. In another example, in vitro 
transcription systems can be regulated by the addition of 
one or more appropriate cyclic compounds, particularly 

15 cyclic polyamides. 

EXAMPLES 

The following examples below are non- limiting and are 
offered merely for purposes of illustrating the practice of 

20 certain preferred embodiments of the invention herein 

described, specifically the design, synthesis, and testing 
of two representative 8 ring cyclic polyamide compounds. 
Illustration of the techniques described in the examples and 
in the detailed description above may be found in any of 

25 several well-known references, such as: Sambrook, J., et 
al . , Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratory Press (1989); Goeddel, D., ed. , Gene 
Expression Technology, Methods in Enzymology, 185, Academic 
Press, San Diego, CA (1991) ; "Guide to Protein Purification" 
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in Deutshcer, M.P., ed., Methods in Enzymology, Academic 
Press, San Diego, CA (1989); Innis, et al„, PCR Protocols: A 
Guide to Methods and Applications, Academic Press, San 
Diego, CA (1990); Freshney, R.I., Culture of Animal Cells: 
5 A Manual of Basic Technique, 2 nd Ed., Alan Liss, Inc. New 
York, NY (1987); Murray, E.J., ed. , Gene Transfer and 
Expression Protocols, pp. 109-128, The Humana Press Inc., 
Clifton, NJ and Lewin, B., Genes VI, Oxford University 
Press, New York (1997) . 

10 

Example 1: Synthesis of Cyclic, 8 -Ring Polvamides. 

This example describes the synthesis of several cyclic, 
8 -ring polyamides that bind with high affinity and 
specificity to 6 bp target sites through DNA-polyamide 

15 interactions in the minor groove of dsDNA. To optimize the 
cyclic polyamide motif, at least one chiral linker was used 
to cyclize two polyamide subunits each comprised of 4 -rings. 
Specifically, substitution of the prochiral a-position of 
the y-tum residue to provide (R) -2 , 4 , -diaminobutyric acid 

20 ( (R) H,N y) yielded chiral cyclic polyamides with enhanced DNA- 
binding affinity, sequence specificity, and orientation 
preference. The polyamides described below can be described 
as follows: cyclo- (y-Im-Py-Py-Py- (R) H ' N y-Imz.Py-Py-Py-) (3) 
and cyclo- (y- Im-Py-Py-Py- (R) H ' N y-Py_iPy-Py-Py- ) (4) . Cyclic 

25 polyamides (3) and (4) differ by a single amino acid 

substitution (underlined) , and each recognize a different 
specific 6 bp match site, i.e. , 5--AGTACT-3' and 5 1 -AGTATT- 
3', respectively, which match sites differ from one another 
by a single base pair (underlined) . Thus, the match site 
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for one of these polyamides can serve as a single base pair 
mis-match site for the other polyamide. 

As controls, the binding affinities and sequence 
specificities of the Im-Py-Py-Py-P-Dp (1) unlinked 2:1 dimer 
5 and a hairpin analog lm-Py-Py-Py- (R) H ' N y-Im-Py-Py-Py-C3-0H 
(2) were also studied. An EDTA analogue, cyclo- (y-Im-Py-Py- 

Py- (R) EDTA " Fe(II, y-Py-Py-Py-Py-) (4-E»Fe(II)) was constructed to 
confirm the binding orientation of cyclic polyamide (4) at 
its 5 1 -AGTATT-3 ' match and 5 ' - AGTACT - 3 ' single base pair 

10 mismatch site. All polyamides were synthesized by solid 
phase methods ( see Baird, et al . , J . Am . Chem . Soc . , vol . 
118:6141 (1996), and USSN 08/607,078 for a description of 
solid phase synthesis methods for non-cyclic polyamides) . 
Polyamide purity and identity was confirmed by H NMR, MALDI- 

15 TOF MS, and analytical HPLC. Precise binding site sizes 

were determined by MPE*Fe(II) footprinting ( see Van Dyke, et 
al . , Proc. Natl, Acad. Sci . U.S.A. , vol . 79:5470 (1982), and 
Van Dyke and Dervan, Science , vol . 225:1122 (1984)), and 
binding orientation and stoichiometry confirmed by affinity 

20 cleaving experiments. See Taylor, et al . , Tetrahedron , 

vol. 40 :457 (1984), and Dervan, P. B., Science , vol , 232:464 
(1986) . Equilibrium association constants (K a ) of the 
polyamides for their respective match and mis -match binding 
sites were determined by quantitative DNase I footprint 

25 titration, as described by Brenowitz, et al . , Methods 

Enzymol . , vol . 130:132 (1986), Brenowitz, et al . . Proc . 
Natl. Acad. Sci. U.S.A. , vol 83:8462 (1986), and Senear, et 
al. , Biochemistry , vol . 25:7344 (1986). 
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a. Resin Synthesis 

In order to prepare the cyclic polyamides , including 
those described herein, by solid state methods, a suitable 
resin is required. One such resin, comprising a Py-PAM 
5 ester (5) was prepared according to the published procedures 
of Mitchell, et al . ( J, Org. Chem. , vol 43:2845 (1978)) with 
Boc-Py acid substituted for the standard Boc protected a- 
amino acid ( see Figure 2) . The phenacyl ester (6) was 
selectively cleaved (using Zn and AcOH) , and the resultant 

10 acid activated with DCC and HOBT, followed by reaction of 
the activated ester with an excess of 0.7 mmol/g of 
arninomethylated polystyrene for 24 hours (DIEA, DMF) to 
give Boc-Py-PAM-resin (7). Reactions were stopped at 0.1 
mmol/g substitution as determined by quantitative ninhydrin 

15 analysis of free groups. See Sarin, et al . , Anal . Biochem. , 
vol . 117:147 (1981). Unreacted groups were capped by 
acetylation (AC 2 0, DIEA, DMF) . Picric acid titration ( see 
Gisin, B. F., Anal. Chim. Acta. , vol . 58:248 (1972)) of Py- 
amino groups was used to verify resin loading of 0.1 mmol/g. 

20 b. Synthesis of a Control Hairpin Polyamide 

Im-Py-Py-Py- (R) H/N y - Im-Py-Py-Py-p-Parn resin was 
synthesized by machine assisted protocols in 18 steps from 
commercially available Boc-(3-Ala-PAM resin ( see Figure 
3(a)), according to methods described in Baird, et al . , ±L_ 

25 Am. Chem. Soc , vol. 118:6141 (1996), and USSN 08/607,078. 
The polyamide was cleaved from the resin by a single step 
reduction with lithium borohydride (EtOH, 60°C) , followed by 
reverse phase HPLC purification to yield the desired hairpin 
polyamide, polyamide (2) . 
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c. Cyclic Polyamide Synthesis 

Two polyamide resins, Cbzy- Im-Py-Py-Py- (R) ^^y-Py-Py-Py- 
Py-PAM resin and Cbzy- Im-Py-Py-Py- (R) ^^y-Im-Py-Py-Py-PAM 
resin, were synthesized in 18 steps from the Boc-Py-PAM 
5 resin (600 mg of resin, 0.1 mmol/g of substitution) 
described above using machine-assisted Boc-chemistry 
protocols ( see Figure 3(b)) modified as follows: The (R) - 
2 , 4-diaminobutyric acid residue was introduced as an 
orthogonally protected N-a-Fmoc-N-y-Boc derivative (10) 

10 (HBTU, DIEA) . The final step was introduction of Cbzy-Im 
acid (11) as a dimer block (HBTU, DIEA) . See USSN 
08/607,078. The Fmoc-protected polyamide resins Cbzy-Im-Py- 
Py_Py_ ( R ) Fmoc y _ py - py - py _ Py _ pam resin and Cbzy- Im- Py-Py-Py- 
(R) Fmoc y- Im-Py-Py-Py-PAM resin were treated with 1:4 

15 DMF:Piperidine (22°C, 30 min) to provide Cbzy- Im-Py-Py-Py- (R) 
H ' N y-Py-Py-Py-Py-PAM-resin and Cbzy- Im-Py-Py-Py- (R) H ' N y- Im-Py- 
Py-Py~PAM resin, respectively. The amine resins were then 
treated with Boc-anhydride (DIEA, DMF, 55 °C, 3 0 min) to 
produce Cbzy- Im-Py-Py-Py- (R) Boc y-Py-Py-Py-Py-PAM resin and 

20 Cbzy-Im-Py-Py-Py- (R) Boc y-Im-Py-Py-Py-PAM resin. A single step 
catalytic transfer hydrogenolysis was used to cleave each 
polyamide from the solid support and remove the Cbz 
protecting group from the N-terminal y residue. A sample of 
the resin (240 mg) was treated with palladium acetate (2 ml 

25 DMF, 240 mg Pd(OAc) 2 ' 37°C, 10 min). Ammonium formate was 

then added (500 mg, 8 hr) and the reaction mixture purified 
by reverse phase HPLC to provide H 2 Ny-Im-Py-Py-Py- (R) Boc y-Py- 
Py-Py-Py-COOH (13) and H 2 N-y-Im-Py-Py-Py- (R) Boc y- Im-Py-Py-Py- 
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COOH (12). Cyclization of H 2 N-y- Im-Py-Py-Py- (R) Boc y-Py-Py-Py- 
Py-COOH (13) and H 2 N-y- Im-Py-Py-Py- (R) Boc y- Irrt-Py-Py-Py-COOH 
(12) was achieved with DPPA and potassium carbonate, as 
described previously. See : Cho, et al . , Proc . Natl. Acad. 
5 Sci . U.S.A. , vol . 92:10389 (1995). The Boc-protecting group 
was then removed in situ by treatment, with neat TFA to yield 
the cyclic compounds cyclo- (y-Im-Py-Py-Py- (R) H,N y- Im-Py-Py- 
Py-) (3) and cyclo- (y-Im~Py-Py-Py- (R) H ' N y-Py-Py-Py-Py~ ) (4), 
followed by purification by reverse phase HPLC. The cyclic 
10 polyamides were obtained with similar yield and purity, and 
had similar solubility, as compared to their hairpin 
counterparts . 

d. Binding Site Size Determination 
MPE»Fe(II) footprinting on 3 1 or 5 f - 32 P end- labeled 229 

15 base pair restriction fragments revealed that cyclic 

polyamides (3) and (4), at 10 nM concentration, each bound 
to its designated 6 bp match sites (25 mM HEPES buffer (pH 
7.3), 200 mM NaCl , 50 \xq/ml glycogen, 5 mM DTT, 0.5 |iM 
MPE*Fe(II), and 22°Q ( see Figure 4)). Cyclic polyamide (3), 

20 which contains an Im/Py and a Py/Im pair, was found to 

protect the cognate 5'-AGTACT-3' match site. Binding of 
polyamide (3) to a single base pair mis match site ( e.g. , 
5 ' -AGTATT-3 1 ) was detected only at much higher polyamide 
concentrations. Polyamide (4), which contains a single 

25 Im/Py pair, protected its targeted match site 5 1 -AGTATT-3 1 
and the single base pair mismatch site 5 1 -AGTACT-3 1 . The 
sizes of the asymmetrically 3' -shifted footprint cleavage 
patterns were consistent with 1:1 cyclic polyamide : dsDNA 
complex formation at 6 bp binding sites. 
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e. Binding Orientation Analysis 

Affinity cleavage experiments using cyclo- (y-Im-Py-Py- 
Py- (R) BDTA#Fe<II, y-Py-Py-Py-Py-) (4-E»Fe (II) ) , which contained an 
EDTA»e(II) cleavage moiety appended to the chiral y linker, 
5 were used to confirm binding orientation and stoichiometry 
for the cyclic polyamide. To synthesize the EDTA analogue, 
cyclo- (y-Im-Py-Py-Py- (R) H,N y-Py-Py-Py-Py- ) (4) was treated 
with an excess of EDTA- di anhydride (DMSO/NMP, DIEA, 55°C, 15 
min) and the remaining anhydride was hydrolyzed (0.1 M NaOH, 

10 55°C, 10 min) . Cyclo- (y-Im-Py-PyPy- (R) EDTA y-Py-Py-Py-Py- ) (4- 
E»Fe(II)) was then isolated by reverse phase HPLC ( see 
Figures 6 and 7) . Affinity cleavage experiments were 
performed on the same 3 1 - or 5'- 32 P end-labeled 229 base pair 
DNA restriction fragment from the plasmid pJT8 (20 mM HEPES 

15 buffer (pH 7.3), 200 mM NaCl, 50 fig/ml glycogen, 5 mM DTT, 1 
juM Fe(II), pH 7.0 and 22°C) . The observed cleavage pattern 
for polyamide 4-E»Fe(II) ( see Figures 6(b) and 6(c), are 3'- 
shifted, a result consistent with minor groove occupancy by 
the cyclic polyamide. In the presence of 100 nM 4-E*Fe(II), 

20 a major cleavage locus proximal to the 3' side of the 5'- 
AGTATT-3 1 match sequence was revealed, consistent with 
formation of an oriented 1:1 cyclic polyamide*DNA complex. 
At the same ligand concentration, minor cleavage loci 
located 3' and 5' adjacent to the single base pair mis-match 

25 S'-AGTACTO 1 site appeared, consistent with dual binding 

orientations at this symmetrical binding site. The cyclic 
polyamide binding model was further supported by the 
location of cleavage loci at the 5' side of the 5'-AGTATT-3» 
match site ( see Figure 6(c)), and at the 5' and 3' sides of 
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the 5'-AGTACT-3« mismatch site corresponding to the 
EDTA*Fe (II) moiety placement off the ( (R) H ' N y) linker, 
f . Binding Energetics 

Quantitative DNase I footprint titrations (10 mM 
Tris^HCI, 10 mM KC1, 10 mM MgCl 2 and 5 mM CaCl 2 pH 7.0 and 
22 °C) were performed to determine the equilibrium 
association constants (K a ) of Im-Py-Py-Py-P-Dp (1), Im-Py-Py- 
Py-(R) H ' N Y-Im-Py-Py-Py-OH (2), cyclo- (y- Im- Py-Py- Py- (R) H - N y- 
Im-Py-Py-Py) (3), and cyclo- (y-Im-Py-Py-Py- (R) H ' N y-p y -Py-p y - 
Py-) (4) for the 6 bp match and mis-match sites ( see Figures 
8 and 9) . 

Unlinked polyamide (1) bound its respective match and 
mis-match sites with an apparent first order association 
constant (eq 2, n = 2) , consistent with 2:1 dimer formation. 
Hairpin polyamide (2) and cyclic polyamides (3) and (4) 
bound to their respective match and mis-match sequences with 
binding isotherms (eq 2, n = 1) consistent with binding in a 
1:1 polyamide^DNA complex. Polyamides bound the 5 1 -AGTACT- 
3' site with decreasing affinity, in the following order: 
match cyclic polyamide (3) > match hairpin polyamide (2) > 
mismatch cyclic polyamide (4) > match unlinked, dimeric 
polyamide (1). Polyamides bound the 5 , -AGTATT-3 l target 
sequence with affinities that decreased in the following 
order: match cyclic polyamide (4) > mis-match cyclic 
polyamide (3) > mis-match hairpin polyamide (2) > mis-match 
unlinked, dimeric polyamide (1) . 
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Covalent coupling of unlinked, dimeric polyamide (1) to 
form hairpin polyamide (2) resulted in a 428 -fold increase 
in the dsDNA binding affinity and comparable DNA binding 
sequence specificity. It is interesting to compare hairpin 
5 polyamide (2), Im-Py-Py-Py- (R) H ' N y-Im-Py-Py-Py-OH, to the 
previously reported hairpin Im-Py-Py-Py-y-Im-Py-Py-Py-P-Dp. 
Each hairpin contains eight aromatic rings and a single C- 
terminal charge located either on the y linker or a C- 
terminal P~Dp group. Although hairpin polyamide (2) binds 

10 to DNA with an affinity and specificity comparable to DNA- 

binding proteins, it binds with 4-fold lower affinity and 5- 
lower sequence specificity than the previously described 
hairpin Im-Py-Py-Py-y- Im-Py-Py-Py-p-Dp . This probably 
results from the loss of structure that provides for 

15 favorable interactions between the P~Dp group and A, T rich 

sequences flanking the target site. Since cyclic polyamides 
lack any C- terminal P~Dp group, hairpin polyamide (2) was 
determined to be a more suitable control for the studies 
herein described. 

20 On the basis of the pairing rules for polyamide^DNA 

complexes, the 5 1 -AGTACT-3 1 and 5 1 -AGTATT-3 r target sites 
represented "match" and "single base pair mis -match" sites 
for cyclic polyamide (3) respectively, and single base pair 
mis-match and match sites for cyclic polyamide (4) , 

25 respectively. Cyclic -polymide (3) was determined to bind to 
its match 6 bp target sequence, 5 \ -AGTACT-3 1 , with an 
equilibrium association constant of K a = 7,6 x 10 10 M" 1 , and 
55-fold specificity over the single base pair mis-match site 
5' -AGTATT-3' site <K a = 1.3 x 10 9 M" 1 ) . These affinities 
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represented a 3,600-fold increase relative to the unlinked, 
dimeric polyamide formed by polyamide (1) , and an 8 -fold 
enhancement relative to hairpin polyamide (2) . Furthermore, 
the affinity and specificity of cyclic polyamide (3) were 
comparable to the previously described hairpin Im-Py-Py-Py-y- 
Im-Py-Py-Py-P-Dp. The cyclic polyamide (4), cyclo- (y-Im-Py- 
Py-Py-(R) H,N y-Py-Py-Py-Py-) , which contains a single Im/Py 
pair, preferentially bound its match site, 5 r -AGTATT-3 1 , 
with an affinity of K a = 3 . 1 x 10 9 M" 1 , versus the single base 
pair mis-match site 5'-AGTACT-3 ! (K a = 4.2 x 10 8 M" 1 ) with a 
7- fold preference. Therefore, replacing a single pyrrole 
amino acid in cyclo- (y-Im-Py-Py-Py- (R) H ' N y- Py~ Py-Py-Py) (4) 
with an imidazole residue to make cyclo- (y-Im-Py-Py-Py- (R) 
H ' N y - Im- Py-Py-Py-) (3), regulated cyclic polyamide 
specificity and affinity by 2 orders of magnitude. 

The representative cyclic polyamides described above 
were capped at the N- terminus of each polymer portion in 
order to complete the cycle. However, N-terminal 
acetylation has been found to reduce hairpin polyamide 
binding specificity and orientation preference. See White, 
et al. , J . Am . Chem . Soc . , vol. 119:8756-8765 (1997), and 
Parks, et al . . J. Am . Chem . Soc . . vol . 118:6147-6152 (1996). 
Like hairpin polyamides, cyclic polyamides are capable of 
forming two mirror image folded structures, only one of 
which is responsible for 5 l ->3 1 , N-»C match site binding to 
dsDNA. See White, et al . , J . Am . Chem . Soc . , vol . 119:8756- 
8765 (1997) . While not wishing to be bound to a particular 
theory, it is likely that the chiral amine group in one or 
more of the linkers can help to offset any 'acetylation 
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effect' by controlling cyclic polyamide binding orientation 
preference, and hence binding specificity. It is also clear 
from the above data that the clear from comparison of the 3- 
and 4 -ring cycles, that the detailed rules which have been 
5 developed to guide polyamide design have to be carefully 

considered for successful discovery and evaluation of useful 
Hp-Im-Py polyamide binding motifs. 
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* * * 

One skilled in the art will readily appreciate that the 
present invention is well adapted to carry out the objects 
and obtain the ends and advantages mentioned, as well as 
those inherent, therein. The compounds, methods, 
treatments, procedures, and applications described herein 
are presently representative of preferred embodiments, and 
are exemplary and not intended as limitations on the scope 
of the invention. Changes, modifications, and alternatives 
therein and other uses will occur to those skilled in the 
art upon review of the above description, which changes are 
encompassed within the spirit of the invention and the 
appended claims. 

The invention illustratively described herein suitably 
may be practiced in the absence of any element or elements, 
or limitation or limitations, which is not specifically 
disclosed herein. Thus, for example, in each instance 
herein any of the terms "comprising", "consisting 
essentially of" and "consisting of" may be replaced with 
either of the other two terms. The terms and expressions 
which have been employed are used as terms of description 
and not of limitation, and there is no intention that in the 
use of such terms and expressions of excluding any 
equivalents of the features shown and described or portions 
thereof, but it is recognized that various modifications are 
possible within the scope of the invention claimed. 

All patents, patent applications, and other 
publications and references mentioned in the specification 
are hereby incorporated by reference in their entirety, and 
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are indicative of the levels of those skilled in the art to 
which the invention pertains. 
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CLAIMS 

WE CLAIM: 

1. A cyclic compound comprising at least seven units, 
one or more of which is a hydrogen bond donor or acceptor, 
which cyclic compound forms nucleotide base pair-specific 
interactions with a double - stranded nucleic acid molecule 
under physiological conditions, but excluding cyclo- (Im-Py- 
Im-Py-y-Im-Py-Im-Py- (G-Dp) -y) , cyclo (Im-Py-Im-Py-y- Im-Py-Im- 
Py- (G-Dp) ) , and cyclo (Im-Py-Py-Py-y-lm-Py-Py-Py- (G-Dp) ) . 

2. A cyclic compound according to claim 1 wherein at 
least one of the hydrogen bond donors is an amino acid. 

3. A cyclic compound according to claim 1 wherein the 
compound comprises a first polymer portion comprising at 
least three hydrogen bond donors, which first polymer 
portion is linked to a second polymer portion comprising at 
least four hydrogen bond donors, 

4. A cyclic compound according to claim 3 wherein at 
least two of the hydrogen bond donors are covalently 
attached by an amide bond. 

5. A cyclic compound according to claim 3 wherein the 
first polymer portion comprises at least four hydrogen bond 
donors . 
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6. A cyclic compound according to claim 5 wherein at 
least one of the hydrogen bond donors is an amino acid. 

7. A cyclic compound according to claim 6 wherein 
5 each of the hydrogen bond donors is an amino acid. 

8 . A cyclic compound according to claim 7 wherein the 
first and second polymer portions comprise equal numbers of 
amino acids. 

10 

9. A cyclic compound according to claim 8 wherein at 
least one of the amino acids of each of the first and second 
polymer portions comprises an aromatic ring. 

15 10. A cyclic compound according to claim 9 wherein 

each aromatic ring has 5 or 6 members. 

11. A cyclic compound according to claim 10 wherein 
the members of each of the aromatic rings are independently 

20 selected from the group consisting of carbon, nitrogen, 
sulfur, and oxygen. 

12 . A cyclic compound according to claim 11 wherein 
the amino acids of the first and second polymer portions are 

25 selected from the group consisting of imidazole, pyrrole, 
hydroxypyrrole, and (3-alanine. 

13. A cyclic compound according to claim 8 wherein 
each hydrogen bond donor of the first polymer portion aligns 

30 with a hydrogen bond donor of the second polymer portion to 



WO 00/40605 



PCf/USOO/00298 



82 



form a donor pair that preferably associates under 
physiological conditions with at least one, but not all, 
nucleotide base pairs selected from the group consisting of 
A/T, T/A, G/C, or C/G. 

14. A cyclic compound according to claim 3 wherein at 
least one of said linkages between the first and second 
polymer portions occurs through a molecule comprising an 
aliphatic chain. 

15. A cyclic compound according to claim 3 wherein the 
linkages between the first and second polymer portions occur 
through two different molecules, each of which comprises an 
aliphatic chain. 

16. A cyclic compound according to claim 15 wherein 
one or both molecules linking the first and second portions 
is aminobutyric acid. 

17. A cyclic compound according to claim 16 wherein 
the molecule is y-aminobutyric acid. 

18. A cyclic compound according to claim 17 wherein 
the y-aminobutyric acid is a substituted derivative. 

19. A compound comprising a cyclic compound according 
to claim 1 covalently associated with at least one 
independent compound . 
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20. A compound according to claim 19 wherein the 
independent compound is selected from the group consisting 
of a protein, a nucleic acid, another cyclic compound 
according to claim 1, a hairpin polyamide molecule, an H-pin 

5 polyamide molecule, extended polyamide molecule, and an 
overlapped polyamide molecule. 

21. A compound according to any of claims 1-20 that 
binds with subnanomolar affinity to a target nucleic acid 

10 sequence in a double- stranded nucleic acid molecule. 



22. A compound according to claim 21 that binds to a 
target nucleotide base pair sequence with a specificity of 

15 at least about 2. 

23. A compound according to claim 21 that associates 
under physiological conditions with a minor groove of a 
double- stranded nucleic acid molecule. 

20 

24 . A cyclic compound comprising a first polymer 
portion comprising at least three hydrogen bond donors, 
which first polymer portion is linked to a second polymer 
portion comprising at least four hydrogen bond donors, 

25 wherein the cyclic compound has a molecular structure that 
forms nucleotide base pair-specific interactions with a 
double -stranded nucleic acid molecule under physiological 
conditions . 
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25. A cyclic compound comprising a first polymer 
portion covalently attached a second polymer portion via two 
linkers, wherein the first polymer portion comprises the 
following chemical structure: 
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wherein a > 0 and b x , b 2 and b 3 > 1, x 1# x 2 , x 3/ and 
x 4 each comprise molecules that act as one or more 
hydrogen bond donors, wherein x lt x 2/ x 3/ and/or x 4 may 
be the same or different molecules, and z Xt z 2t and z 3 
each is a covalent linkage between adjacent molecules 
that act as hydrogen bond donors, and 

wherein the second polymer portion comprises the 
following chemical structure : 



Y - -z>, 



2 - Z\ ~ 
J*2 



Y 3 



Jc 3 



Y 4 - 



Jc 4 



wherein c X/ c 2 , c 3 and c 4 > 1, y 1# y 2 , y 3/ and y 4 
each comprise molecules that act as one or more 
hydrogen bond donors, wherein y lt y 2/ y 3/ and/or y 4 may 
be the same or different, and z' Xi z' 2 , and z' 3 each is 
a covalent linkage between adjacent molecules that act 
as hydrogen bond donors, and 

wherein the first and second polymer portions are 
linked by two linkers extending between the first and 
second polymer portions to produce a cyclic compound 
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having a molecular structure that forms nucleotide base 
pair-specific interactions with a double -stranded 
nucleic acid molecule under physiological conditions. 

5 26. A pharmaceutical composition comprising a 

pharmaceutically acceptable carrier and a compound according 
to any of claims 1-25. 

27. A method of modulating expression of a gene of 
10 interest by administering an amount of a cyclic compound 

according to any of claims 1-25 sufficient to modulate 
expression of the gene of interest. 

28. A method according to claim 27 performed in 
15 vitro. 

29. A method according to claim 27 performed on 
cultured cells. 

20 30. A method according to claim 27 performed in vivo. 

31. A method according to claim 27 performed for a 
therapeutic purpose. 

25 

32. A method according to claim 31 wherein the 
therapeutic purpose is treatment of a disease associated 
with aberrant expression of the gene of interest. 
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33. A method according to claim 31 wherein the 
therapeutic purpose is treatment of a disease associated 
with the presence of a pathogen. 

34. A method according to claim 27 wherein 
administration of the cyclic compound increases expression 
of the gene of interest. 

35. A method according to claim 27 wherein 
administration of the cyclic compound decreases expression 
of the gene of interest . 

36. A cyclic compound comprising a molecular structure 
under physiological conditions that interacts with a binding 
affinity of less than about 100 nM and a specificity of at 
least about two with a specific sequence of at least four 
nucleotide base pairs in a double -stranded nucleic acid. 
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FIGURE 3 
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FIGURE 4A 
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FIGURE 4B 
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FIGURE 7 
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FIGURE 8 
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